Skip to content

Actions: openai/evals

Actions

Run new evals

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
128 workflow runs
128 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Upgrade openai to >=1.0.0
Run new evals #2172: Pull request #1420 synchronize by etr2460
December 5, 2023 00:47 1m 59s erik/openai-1.0
December 5, 2023 00:47 1m 59s
Upgrade openai to >=1.0.0
Run new evals #2171: Pull request #1420 synchronize by etr2460
November 29, 2023 22:59 1m 53s erik/openai-1.0
November 29, 2023 22:59 1m 53s
Upgrade openai to >=1.0.0
Run new evals #2170: Pull request #1420 synchronize by etr2460
November 29, 2023 01:09 1m 55s erik/openai-1.0
November 29, 2023 01:09 1m 55s
Upgrade openai to >=1.0.0
Run new evals #2169: Pull request #1420 synchronize by etr2460
November 28, 2023 23:51 1m 50s erik/openai-1.0
November 28, 2023 23:51 1m 50s
Upgrade openai to >=1.0.0
Run new evals #2168: Pull request #1420 synchronize by etr2460
November 28, 2023 23:45 1m 51s erik/openai-1.0
November 28, 2023 23:45 1m 51s
Upgrade openai to >=1.0.0
Run new evals #2167: Pull request #1420 synchronize by etr2460
November 28, 2023 23:39 1m 59s erik/openai-1.0
November 28, 2023 23:39 1m 59s
Upgrade openai to >=1.0.0
Run new evals #2166: Pull request #1420 opened by etr2460
November 28, 2023 23:15 2m 3s erik/openai-1.0
November 28, 2023 23:15 2m 3s
Bluff eval
Run new evals #2165: Pull request #1402 synchronize by johny-b
November 15, 2023 12:10 2m 14s johny-b:bluff
November 15, 2023 12:10 2m 14s
Bluff eval
Run new evals #2164: Pull request #1402 synchronize by johny-b
November 15, 2023 07:55 1m 54s johny-b:bluff
November 15, 2023 07:55 1m 54s
Bluff eval
Run new evals #2163: Pull request #1402 synchronize by johny-b
November 15, 2023 07:52 2m 6s johny-b:bluff
November 15, 2023 07:52 2m 6s
Bluff eval
Run new evals #2162: Pull request #1402 synchronize by johny-b
November 15, 2023 07:47 2m 35s johny-b:bluff
November 15, 2023 07:47 2m 35s
Bluff eval
Run new evals #2161: Pull request #1402 synchronize by andrew-openai
November 15, 2023 02:56 2m 3s johny-b:bluff
November 15, 2023 02:56 2m 3s
Sandbagging eval
Run new evals #2160: Pull request #1409 synchronize by ojaffe
November 14, 2023 11:54 1m 49s ojaffe:ollie/Sandbagging-v1
November 14, 2023 11:54 1m 49s
Sandbagging eval
Run new evals #2159: Pull request #1409 synchronize by ojaffe
November 14, 2023 11:24 1m 48s ojaffe:ollie/Sandbagging-v1
November 14, 2023 11:24 1m 48s
Sandbagging eval
Run new evals #2158: Pull request #1409 synchronize by ojaffe
November 14, 2023 10:59 1m 40s ojaffe:ollie/Sandbagging-v1
November 14, 2023 10:59 1m 40s
Sandbagging eval
Run new evals #2157: Pull request #1409 opened by ojaffe
November 14, 2023 10:59 1m 46s ojaffe:ollie/Sandbagging-v1
November 14, 2023 10:59 1m 46s
MMP v2 eval
Run new evals #2156: Pull request #1403 synchronize by ojaffe
November 14, 2023 10:45 1m 46s ojaffe:ollie/MMP_v2
November 14, 2023 10:45 1m 46s
Add theory of mind eval
Run new evals #2155: Pull request #1405 synchronize by inwaves
November 14, 2023 10:05 2m 19s inwaves:feature/theory_of_mind
November 14, 2023 10:05 2m 19s
Add theory of mind eval
Run new evals #2154: Pull request #1405 synchronize by inwaves
November 14, 2023 10:03 1m 57s inwaves:feature/theory_of_mind
November 14, 2023 10:03 1m 57s
Add theory of mind eval
Run new evals #2153: Pull request #1405 synchronize by inwaves
November 14, 2023 09:57 1m 42s inwaves:feature/theory_of_mind
November 14, 2023 09:57 1m 42s
Migrate from openai==0.28.1 to openai==1.2.4
Run new evals #2152: Pull request #1407 opened by johny-b
November 14, 2023 09:52 1m 42s johny-b:migrate
November 14, 2023 09:52 1m 42s
Self-Prompting eval
Run new evals #2151: Pull request #1401 synchronize by JunShern
November 14, 2023 09:13 2m 27s JunShern:jun/self-prompting-eval
November 14, 2023 09:13 2m 27s
Bluff eval
Run new evals #2150: Pull request #1402 synchronize by johny-b
November 14, 2023 08:54 1m 57s johny-b:bluff
November 14, 2023 08:54 1m 57s
Bluff eval
Run new evals #2149: Pull request #1402 synchronize by johny-b
November 14, 2023 08:22 2m 33s johny-b:bluff
November 14, 2023 08:22 2m 33s
Add theory of mind eval
Run new evals #2148: Pull request #1405 opened by inwaves
November 10, 2023 14:24 2m 8s inwaves:feature/theory_of_mind
November 10, 2023 14:24 2m 8s