Yet another llm model format to support? AWQ #448

dreemur99 · 2023-09-24T12:09:17Z

dreemur99
Sep 24, 2023

Im sure you already seen it already but theres a another new model format. AWQ. Claims to be "blazing-fast" with much lower vram requirements. Looks like an almost 45% reduction in reqs.

LostRuins · 2023-09-25T10:18:31Z

LostRuins
Sep 25, 2023
Maintainer

Probably won't be adding this on my own, unless llama.cpp also adds it, then koboldcpp will inherit their support.

1 reply

dreemur99 Sep 25, 2023
Author

makes sense.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yet another llm model format to support? AWQ #448

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Yet another llm model format to support? AWQ #448

dreemur99 Sep 24, 2023

Replies: 1 comment · 1 reply

LostRuins Sep 25, 2023 Maintainer

dreemur99 Sep 25, 2023 Author

dreemur99
Sep 24, 2023

Replies: 1 comment 1 reply

LostRuins
Sep 25, 2023
Maintainer

dreemur99 Sep 25, 2023
Author