Jelajahi Sumber

adding mmlu to leaderboard confgs

Hamid Shojanazeri 1 tahun lalu
induk
melakukan
85c66acf21
2 mengubah file dengan 11 tambahan dan 1 penghapusan
  1. 2 1
      eval/eval.py
  2. 9 0
      eval/open_llm_leaderboard/mmlu_5shots.yaml

+ 2 - 1
eval/eval.py

@@ -74,7 +74,8 @@ def load_tasks(args):
             "hellaswag_10_shot",
             "truthfulqa_mc2",
             "winogrande_5_shot",
-            "gsm8k"
+            "gsm8k",
+            "mmlu",
         ]
     return args.tasks.split(",") if args.tasks else []
         

+ 9 - 0
eval/open_llm_leaderboard/mmlu_5shots.yaml

@@ -0,0 +1,9 @@
+include: {$EVAL_PATH}/lm_eval/tasks/mmlu/default/_mmlu.yaml
+task:
+  - mmlu_stem
+  - mmlu_other
+  - mmlu_social_sciences
+  - mmlu_humanities
+num_fewshot: 5
+metric_list:
+  - metric: acc