What is the correct way to evaluate the base model deepseek-math-7b-base? #25

adventuree-cyber · 2024-09-17T14:44:49Z

I used the config at opencompass/configs/datasets/MathBench/mathbench_2024_gen_1dc21d.py and set the use_ppl_single_choice = True. I'm not sure if this is the correct config to test a base model like deepseek-math-7b-base.

liushz · 2024-10-09T05:55:26Z

I recommend you use mathbench_2024_few_shot_mixed_4a3fd4.py for base model evaluation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the correct way to evaluate the base model deepseek-math-7b-base? #25

What is the correct way to evaluate the base model deepseek-math-7b-base? #25

adventuree-cyber commented Sep 17, 2024

liushz commented Oct 9, 2024

What is the correct way to evaluate the base model deepseek-math-7b-base? #25

What is the correct way to evaluate the base model deepseek-math-7b-base? #25

Comments

adventuree-cyber commented Sep 17, 2024

liushz commented Oct 9, 2024