fix the error of TTFT and TPOT while the bench target is chatqna_qlist_pubmed #238

leslieluyu · 2025-02-19T10:34:32Z

Description

missing streaming config of chatqna_qlist_pubmed
Lack of the config will cause the error of "next token latency"

example

End to End latency(ms),    P50: 25875.62,   P90: 34988.45,   P99: 56357.57,   Avg: 26382.29
Time to First Token-TTFT(ms),   P50: 25875.21,   P90: 34987.96,   P99: 56357.17,   Avg: 26381.87
Time Per Output Token-TPOT(ms),   P50: 0.00,   P90: 0.00,   P99: 0.01,   Avg: 0.00

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

deploy the chatqna v1.2
git pull the evel code
3.use locust command line to verify bench-target "chatqna_qlist_pubmed"

locust --locustfile ~/GenAIEval/evals/benchmark/stresscli/locust/aistress.py --host http://IP:PORT --run-time 60m  --max-output 128  --seed 1024 --processes 8 --users 128 --spawn-rate 100 --max-request 640 --bench-target chatqna_qlist_pubmed --llm-model meta-llama/Meta-Llama-3-8B-Instruct --stop-timeout 120 --csv /opt/opea/test_reports/test_locust/ --headless --only-summary --loglevel WARNING --json

Signed-off-by: leslieluyu <[email protected]>

for more information, see https://pre-commit.ci

fix the issue of streaming config of chatqna_qlist_pubmed

f8d7656

Signed-off-by: leslieluyu <[email protected]>

leslieluyu requested a review from lvliang-intel as a code owner February 19, 2025 10:34

[pre-commit.ci] auto fixes from pre-commit.com hooks

44d3dfa

for more information, see https://pre-commit.ci

chensuyue approved these changes Feb 20, 2025

View reviewed changes

lvliang-intel approved these changes Feb 20, 2025

View reviewed changes

lvliang-intel merged commit da04a9f into opea-project:main Feb 20, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix the error of TTFT and TPOT while the bench target is chatqna_qlist_pubmed #238

fix the error of TTFT and TPOT while the bench target is chatqna_qlist_pubmed #238

leslieluyu commented Feb 19, 2025

fix the error of TTFT and TPOT while the bench target is chatqna_qlist_pubmed #238

fix the error of TTFT and TPOT while the bench target is chatqna_qlist_pubmed #238

Conversation

leslieluyu commented Feb 19, 2025

Description

Issues

Type of change

Dependencies

Tests