You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I want to reproduce the lm evaluation harness results reported in the blog. Since the prompts need to be formatted with the user, assistant, system, end tokens, the evaluation harness does not work out of the box. I'm wondering if the team can share the script used to report the results in the table!
The text was updated successfully, but these errors were encountered:
Hello, I want to reproduce the lm evaluation harness results reported in the blog. Since the prompts need to be formatted with the user, assistant, system, end tokens, the evaluation harness does not work out of the box. I'm wondering if the team can share the script used to report the results in the table!
The text was updated successfully, but these errors were encountered: