0.12.6
- Fixed a bug in
launch.py
that caused the model to be redownloaded even when the user opted not to download it again. - The model name is now returned by the API.
- The
quantizeF32toQ80
function now includes an implementation that uses AVX2 instructions.