-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
benchmark and evals changes for Llama 3.1 70B v0 drop testing
change log: - add benchmark_summary.py to give readable markdown summary stats and store .csv - update benchmark scripts for stats calculation and context length pairs - add setup to evals/run_evals.sh - update documentation for new v0 drop
- Loading branch information
Showing
16 changed files
with
538 additions
and
282 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.