How to do sparsity and quantization-aware-training ? #122

Vieeo · 2025-01-23T04:01:01Z

How to implement sparsification and quantization-aware-training , and keep the sparse mask unchanged during quantization.

SilvesterHsu · 2025-02-06T07:26:21Z

I guess this will be helpful: https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31552/
Original paper: https://arxiv.org/abs/2104.08378
Blog: https://developer.nvidia.com/blog/sparsity-in-int8-training-workflow-and-best-practices-for-tensorrt-acceleration/

Vieeo · 2025-02-07T08:42:00Z

I guess this will be helpful: https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31552/ Original paper: https://arxiv.org/abs/2104.08378 Blog: https://developer.nvidia.com/blog/sparsity-in-int8-training-workflow-and-best-practices-for-tensorrt-acceleration/

Thanks, it can be done with your method. Now, nvidia-modelopt doesn’t seem to keep the sparse mask unchanged during quantization. Do you know the sdk？

yairb-gm · 2025-03-04T13:49:24Z

I have the same question - Is it possible to freeze sparse mask while doing QAT?
@Vieeo have you find solution / workaround that can be applied?

cjluo-nv assigned RalphMao Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do sparsity and quantization-aware-training ? #122

How to do sparsity and quantization-aware-training ? #122

Vieeo commented Jan 23, 2025

SilvesterHsu commented Feb 6, 2025

Vieeo commented Feb 7, 2025

yairb-gm commented Mar 4, 2025

How to do sparsity and quantization-aware-training ? #122

How to do sparsity and quantization-aware-training ? #122

Comments

Vieeo commented Jan 23, 2025

SilvesterHsu commented Feb 6, 2025

Vieeo commented Feb 7, 2025

yairb-gm commented Mar 4, 2025