benchmark

feat: add benchmarks_entrypoint.py (huggingface#34495 )

Dec 18, 2024

9a94dfe · Dec 18, 2024

This branch is 430 commits behind huggingface/transformers:main.

Name	Name	Last commit message	Last commit date
parent directory ..
config	config	[Benchmark] Reuse `optimum-benchmark` (huggingface#30615 )	May 21, 2024
README.md	README.md	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
__init__.py	__init__.py	[Benchmark] Reuse `optimum-benchmark` (huggingface#30615 )	May 21, 2024
benchmark.py	benchmark.py	Fix benchmark script (huggingface#32635 )	Aug 22, 2024
benchmarks_entrypoint.py	benchmarks_entrypoint.py	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
default.yml	default.yml	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
grafana_dashboard.json	grafana_dashboard.json	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
grafana_datasource.yaml	grafana_datasource.yaml	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
init_db.sql	init_db.sql	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
llama.py	llama.py	feat: add `benchmarks_entrypoint.py` (huggingface#34495 )	Dec 18, 2024
optimum_benchmark_wrapper.py	optimum_benchmark_wrapper.py	[Benchmark] Reuse `optimum-benchmark` (huggingface#30615 )	May 21, 2024
requirements.txt	requirements.txt	refactor: benchmarks (huggingface#33896 )	Oct 11, 2024

README.md

Benchmarks

You might want to add new benchmarks.

You will need to define a python function named run_benchmark in your python file and the file must be located in this benchmark/ directory.

The expected function signature is the following:

def run_benchmark(logger: Logger, branch: str, commit_id: str, commit_msg: str, num_tokens_to_generate=100):

Writing metrics to the database

MetricRecorder is thread-safe, in the sense of the python Thread. This means you can start a background thread to do the readings on the device measurements while not blocking the main thread to execute the model measurements.

cf llama.py to see an example of this in practice.

from benchmarks_entrypoint import MetricsRecorder
import psycopg2

def run_benchmark(logger: Logger, branch: str, commit_id: str, commit_msg: str, num_tokens_to_generate=100):
  metrics_recorder = MetricsRecorder(psycopg2.connect("dbname=metrics"), logger, branch, commit_id, commit_msg)
  benchmark_id = metrics_recorder.initialise_benchmark({"gpu_name": gpu_name, "model_id": model_id})
    # To collect device measurements
    metrics_recorder.collect_device_measurements(
        benchmark_id, cpu_util, mem_megabytes, gpu_util, gpu_mem_megabytes
    )
    # To collect your model measurements
    metrics_recorder.collect_model_measurements(
        benchmark_id,
        {
            "model_load_time": model_load_time,
            "first_eager_forward_pass_time_secs": first_eager_fwd_pass_time,
            "second_eager_forward_pass_time_secs": second_eager_fwd_pass_time,
            "first_eager_generate_time_secs": first_eager_generate_time,
            "second_eager_generate_time_secs": second_eager_generate_time,
            "time_to_first_token_secs": time_to_first_token,
            "time_to_second_token_secs": time_to_second_token,
            "time_to_third_token_secs": time_to_third_token,
            "time_to_next_token_mean_secs": mean_time_to_next_token,
            "first_compile_generate_time_secs": first_compile_generate_time,
            "second_compile_generate_time_secs": second_compile_generate_time,
            "third_compile_generate_time_secs": third_compile_generate_time,
            "fourth_compile_generate_time_secs": fourth_compile_generate_time,
        },
    )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmark

benchmark

README.md

Benchmarks

Writing metrics to the database

Files

benchmark

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmark

Folders and files

parent directory

README.md

Benchmarks

Writing metrics to the database