Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update qkeras converter for array and auto po2 scale #135

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

jurevreca12
Copy link

@jurevreca12 jurevreca12 commented Aug 26, 2024

This PR updates the QKeras converter to enable it to convert neural networks with tensor scale factors, and with scale factors set to "auto_po2".

Previously the converter added a "Quant" node for every "quantized_bits" function in qkeras. Now it adds an additional "Mul" node after a Quant node. This is because of differences in how QKeras and QONNX handle dequantization.

This additional "Mul" node might be a problem for downstream onnx uses, but can be removed when scale factors are equal to one using a transform.

This PR "follows" after PR: Rounding mode new #134 which introduces additional rounding modes. This is why it contains rounding mode changes for now (since they are not yet merged into main).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant