Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request adds support for MHSA with ITA by decoupling the four ITA cores in MemPool and supporting different input matrix shapes (without precisely modelling the tiling of the hardware). Furthermore, it introduces fetching the requantization parameter from memory and specifying the location of the computation result by the user.
Finally, to reduce the generated output when running Banshee in debug mode, I propose to remove the log outputs for aligned and unaligned
memcpy
operation as well as switching from debug to trace messages for some very frequently produced messages.Features
Changes
Fix
Important Note
The softmax values have a maximum value of 127 as
sumdot
modules of the hardware can only do signed-signed operations for now. This is a temporary fix untilsumdot
is fixed.