Skip to content

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #8311

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #8311

Re-run triggered January 24, 2025 22:27
Status Success
Total duration 13m 20s
Artifacts

nv-mii.yml

on: pull_request
Fit to window
Zoom out
Zoom in