v1.2.18
What's Changed
-
Merged PR 3055: Move value unrolling to after function inlining and
loop simplification. [Mason Remy]Move value unrolling to after function inlining and loop simplification
This enables dynamically-sized inner functions that get inlined into
statically-sized regions to have loop unrolling affect their
actually-statically-sized loops when possible -
Merged PR 3053: Add package.build flags for building with higher-
precision FP vector ops. [Mason Remy]Add package.build flags for building with higher-precision FP vector ops
Setting this new flag prevents a vmulps -> vaddps sequence
from being contracted into a vfmaddps -
Merged PR 3052: Place heap allocations at the top level of the
function. [Mason Remy]Place heap allocations at the top level of the function
-
Merged PR 3050: [non-func, API] Change Nest.get_shape() to always
return a list. [Captain Jack Sparrow]Change Nest.get_shape() to always return a list
-
Merged PR 3030: Include acc-translate whenever accera is installed.
[Lisa Ong]Perhaps a longer-term fix is to merge the accera-gpu package into accera-compilers so we have one less package to maintain.
However, that adds constraints to the binary size of acc-opt (to not push us past the 100MB PyPI hard limit), so punting until we have cycles for this.
-
Merged PR 3035: [nfc] Adds my machine to targets.py. [Kern Handa]
Full Changelog: v1.2.17...v1.2.18