Runtime support for CPU hoist ops #2152

vwellsTT · 2025-02-07T21:08:48Z

Problem description

Add runtime support for executing hoisted CPU ops.

What's changed

Add mechanism for reading + opening dylibs from flatbuffer, executing CPU ops (by executing proper function within a given dylib).

Checklist

Merge pre-requisite PR Adding hoisting passes to TTNNToFlatbuffer translation #2147
Add e2e testing
Verify emptyOp logic works correctly

### Problem description This PR adds support for CPU hoisting changes to our TTNNToFlatbuffer translation. ### What's changed This PR introduces new flatbuffer fields for CPU hoisted ops. It hooks up the TTNNToFlatbuffer pass to properly utilize passes introduced in other PRs s.t. a CPU dylib can be embedded in our flatbuffer. Note: this PR is pretty much impossible to test standalone, unfortunately. 2 following PRs will be strictly dependent on this PR merging: 1. #2148 this PR adds earlier support for hoisting in the TTNNPipeline for TTIRToTTNN. However, this generates IR that TTNNToFlatbuffer pass cannot parse until this PR lands. (So this TTNNToFlatbuffer PR is useless without TTIRToTTNN PR, but TTIRToTTNN breaks ttmlir-translate without this PR 😄 ). 2. #2152 Runtime PR, which will add support for actually executing new flatbuffers. This is obviously dependent on this PR + its flatbuffer changes as well.

vwellsTT · 2025-02-21T17:06:48Z

runtime/lib/binary.cpp

@@ -36,21 +37,6 @@ static std::string asJson(void const *fbb, uint8_t const *binarySchema,
  return text;
 }

-static std::vector<uint32_t>


Moving this to a common header because I need it for my cpu.cpp file

vwellsTT · 2025-02-21T17:07:40Z

runtime/lib/ttnn/include/tt/runtime/ttnn/types.h

@@ -141,7 +140,12 @@ class ProgramContext {
      const std::unordered_map<uint32_t, ::ttnn::Tensor *> &liveTensors,
      const std::vector<uint32_t> &programInputs,
      const std::vector<uint32_t> &programOutputs,
-      ::ttnn::MeshDevice *parentMesh);
+      const DylibHandleMap *programDylibs, ::ttnn::MeshDevice *parentMesh)


imo this ctor should be defined in header, since 1. it's trivial and 2. other ctors are defined in header here (via= default admittedly)

vwellsTT · 2025-02-21T17:11:16Z

runtime/lib/ttnn/operations/cpu/cpu.cpp

+      sizes[j] = ins->Get(i)->desc()->shape()->Get(j);
+    }
+    std::vector<int64_t> strides = common::calculateStride(sizes);
+    int64_t *sizes_and_strides = new int64_t[2 * rank];


Don't like using manual new [] calls, but this is annoying because we can't a use an std::vector in this function (since memory needs to outlive this func). I guess we could pass in a 2D vector (one for each of input tensors) we exists in the calling func instead, but that's sort of ugly too imo

vwellsTT · 2025-02-21T17:25:11Z

runtime/include/tt/runtime/detail/strides.h

@@ -0,0 +1,24 @@
+// SPDX-FileCopyrightText: (c) 2024 Tenstorrent AI ULC


Not sure if some existing header here would be more appropriate than a standalone for this--would've said it should maybe go in workarounds.h, but I don't think we have any similar funcs there

vwellsTT mentioned this pull request Feb 7, 2025

Adding hoisting passes to TTNNToFlatbuffer translation #2147

Merged

vwellsTT changed the title ~~copy changes from e2e branch~~ Runtime support for CPU hoist ops Feb 7, 2025

vwellsTT force-pushed the vwells/cpu_hoist_runtime branch from 1ea72a8 to 2fbe384 Compare February 21, 2025 15:11

vwellsTT added 2 commits February 21, 2025 15:15

copy changes from e2e branch

d17b44c

attempt adding some support for ttir_builder hoist test

d947859

vwellsTT force-pushed the vwells/cpu_hoist_runtime branch from 2fbe384 to d947859 Compare February 21, 2025 15:15

fix build

0cc2b7e

vwellsTT commented Feb 21, 2025

View reviewed changes

cleanup

251b447

vwellsTT commented Feb 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime support for CPU hoist ops #2152

Runtime support for CPU hoist ops #2152

vwellsTT commented Feb 7, 2025 •

edited

Loading

vwellsTT Feb 21, 2025

vwellsTT Feb 21, 2025

vwellsTT Feb 21, 2025

vwellsTT Feb 21, 2025

		@@ -0,0 +1,24 @@
		// SPDX-FileCopyrightText: (c) 2024 Tenstorrent AI ULC

Runtime support for CPU hoist ops #2152

Are you sure you want to change the base?

Runtime support for CPU hoist ops #2152

Conversation

vwellsTT commented Feb 7, 2025 • edited Loading

Problem description

What's changed

Checklist

vwellsTT Feb 21, 2025

Choose a reason for hiding this comment

vwellsTT Feb 21, 2025

Choose a reason for hiding this comment

vwellsTT Feb 21, 2025

Choose a reason for hiding this comment

vwellsTT Feb 21, 2025

Choose a reason for hiding this comment

vwellsTT commented Feb 7, 2025 •

edited

Loading