ochougul

Follow

🧪

Onkar Chougule ochougul

🧪

Follow

#quantization #LLMs #ComputerVision

1 follower · 3 following

Achievements

Achievements

Pinned Loading

quic/efficient-transformers quic/efficient-transformers Public

This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficien…

Python 57 36
QLLM QLLM Public

Forked from wejoncy/QLLM

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

Python
wanda wanda Public

Forked from locuslab/wanda

A simple and effective LLM pruning approach.

Python