Skip to content

Commit

Permalink
Set max batch size to 128
Browse files Browse the repository at this point in the history
  • Loading branch information
krishung5 committed Jan 17, 2025
1 parent 199266e commit fd078fd
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions qa/L0_openai/test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -73,8 +73,10 @@ function prepare_tensorrtllm() {
--dtype float16

# 3. Build engine
# max_batch_size set to 128 to avoid OOM errors
trtllm-build --checkpoint_dir ${CKPT_PATH} \
--gemm_plugin auto \
--max_batch_size 128 \
--output_dir ${ENGINE_PATH}

# 4. Prepare model repository
Expand Down

0 comments on commit fd078fd

Please sign in to comment.