Skip to content

v1.2.18

Compare
Choose a tag to compare
@lisaong lisaong released this 26 Jan 00:22
· 13 commits to main since this release

What's Changed

  • Merged PR 3055: Move value unrolling to after function inlining and
    loop simplification. [Mason Remy]

    Move value unrolling to after function inlining and loop simplification

    This enables dynamically-sized inner functions that get inlined into
    statically-sized regions to have loop unrolling affect their
    actually-statically-sized loops when possible

  • Merged PR 3053: Add package.build flags for building with higher-
    precision FP vector ops. [Mason Remy]

    Add package.build flags for building with higher-precision FP vector ops

    Setting this new flag prevents a vmulps -> vaddps sequence
    from being contracted into a vfmaddps

  • Merged PR 3052: Place heap allocations at the top level of the
    function. [Mason Remy]

    Place heap allocations at the top level of the function

  • Merged PR 3050: [non-func, API] Change Nest.get_shape() to always
    return a list. [Captain Jack Sparrow]

    Change Nest.get_shape() to always return a list

  • Merged PR 3030: Include acc-translate whenever accera is installed.
    [Lisa Ong]

    Perhaps a longer-term fix is to merge the accera-gpu package into accera-compilers so we have one less package to maintain.

    However, that adds constraints to the binary size of acc-opt (to not push us past the 100MB PyPI hard limit), so punting until we have cycles for this.

  • Merged PR 3035: [nfc] Adds my machine to targets.py. [Kern Handa]

Full Changelog: v1.2.17...v1.2.18