Skip to content

Commit

Permalink
Add aggregate metrics in leaderboard tasks
Browse files Browse the repository at this point in the history
  • Loading branch information
NathanHB committed Nov 1, 2024
1 parent cc532e5 commit 9e0a6d5
Show file tree
Hide file tree
Showing 4 changed files with 16 additions and 0 deletions.
4 changes: 4 additions & 0 deletions lm_eval/tasks/leaderboard/bbh_mc/_leaderboard_bbh.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -24,3 +24,7 @@ task:
- leaderboard_bbh_tracking_shuffled_objects_seven_objects
- leaderboard_bbh_tracking_shuffled_objects_three_objects
- leaderboard_bbh_web_of_lies
aggregate_metric_list:
- metric: acc_norm
aggregation: mean
weight_by_size: true
4 changes: 4 additions & 0 deletions lm_eval/tasks/leaderboard/gpqa/_leaderboard_gpqa.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,3 +3,7 @@ task:
- leaderboard_gpqa_diamond
- leaderboard_gpqa_extended
- leaderboard_gpqa_main
aggregate_metric_list:
- metric: acc_norm
aggregation: mean
weight_by_size: true
4 changes: 4 additions & 0 deletions lm_eval/tasks/leaderboard/math/_leaderboard_math.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -7,3 +7,7 @@ task:
- leaderboard_math_num_theory_hard
- leaderboard_math_prealgebra_hard
- leaderboard_math_precalculus_hard
aggregate_metric_list:
- metric: exact_match
aggregation: mean
weight_by_size: true
4 changes: 4 additions & 0 deletions lm_eval/tasks/leaderboard/musr/_musr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,3 +3,7 @@ task:
- leaderboard_musr_murder_mysteries
- leaderboard_musr_object_placements
- leaderboard_musr_team_allocation
aggregate_metric_list:
- metric: acc_norm
aggregation: mean
weight_by_size: true

0 comments on commit 9e0a6d5

Please sign in to comment.