[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 #17532

sanjibansg · 2025-01-27T09:39:11Z

This PR brings the first version for the memory optimization for intermediate tensors.

Currently, in the inference code, all the intermediate tensors are initialized at first and are allocated memory, thus no memory reuse takes place. Since not all the intermediate tensors are required till the end, we can flush some of them, and reuse the memory that they release.

This first edition of memory optimization for intermediate tensors uses a simple mechanism, where the total memory required is first calculated along with the count of operators who need a particular intermediate tensor as their input.

While adding a operator into the RModel during the Parsing stage, its input operators are taken into account into an unordered map (added as a new key-value pair if not already present, otherwise frequency is incremented). During the initialize phase, we calculate the total memory required by the tensors. We keep two separate containers - total memory (for calculating the total memory required by all the tensors) and available memory (for accounting all the reusable memory - memory that was flushed and can be reused). During the generate phase, we Initialize the intermediate tensors when a particular operator needs it as an output, and flush them when no operator no longer needs it as an input.

github-actions · 2025-01-27T19:27:48Z

Test Results

18 files 18 suites 4d 2h 36m 12s ⏱️
2 722 tests 2 721 ✅ 0 💤 1 ❌
47 298 runs 47 292 ✅ 0 💤 6 ❌

For more details on these failures, see this check.

Results for commit 310e38b.

♻️ This comment has been updated with latest results.

…valuation method

…iners

… some operators

…RModel fInputTensorNames as std::vectors instead of std::unordered_set

…constant operator case

sanjibansg changed the title ~~[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1~~ [WIP] [TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 Jan 27, 2025

devajithvs assigned lmoneta Jan 27, 2025

sanjibansg changed the title ~~[WIP] [TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1~~ [TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 Feb 3, 2025

sanjibansg marked this pull request as ready for review February 3, 2025 12:02

sanjibansg requested a review from lmoneta as a code owner February 3, 2025 12:02

sanjibansg added 20 commits February 6, 2025 12:34

feat: initial commit for v1 of intermediate memory optimization

fe94353

feat: EvaluateIntermediateMemory method

4bcd0e9

feat: function definitions for allocating and flushing memory

5d06da6

feat: Session memory pool definition

5ee2e51

feat: CheckAndAllocateIntermediateMemory

e7f1fb0

feat: CheckAndFlushIntermediateMemory()

1262835

fix: declaration of method for fetching input/output operators

fcdbfbc

fix: compilation errors due to type mismatches

fa34033

feat: simplify frequency lookup for intermediate tensors and memory e…

8eafb95

…valuation method

fix: CheckAndFlushIntermediateMemory method with the new memory conta…

e27bb72

…iners

fix: type issues and compilation errors

175430b

fix: remove local state declarations for input and output tensors for…

3228dc3

… some operators

fix: index error for total memory chunks

ba0b9c5

fix: using available memory for initializing output tensors

b5914cf

fix: total size of memory pool

1590bca

fix: modify data types for output tensor names

5e5f877

fix: RModel output tensor data type

ed5c971

feat: input and output tensor names for other operators and put back …

9f5961e

…RModel fInputTensorNames as std::vectors instead of std::unordered_set

feat: add output and input tensor names for remaining operators

02df45f

fix: operator fixes for memory optimization

e2eda75

sanjibansg force-pushed the sofie/mem_optim_v1 branch from d57c063 to e2eda75 Compare February 6, 2025 11:57

sanjibansg added 3 commits February 6, 2025 15:52

fix: allocating multiple output intermediate tensors

b65e945

fix: operator input and output tensor names

9b57c26

fix: broadcasting for expand operator with floating point tensor

36ed6cb

sanjibansg added 13 commits February 7, 2025 12:30

feat: initialize intermediate tensor pointers in the session and fix …

330eed3

…constant operator case

fix: referencing from temporary variables into a string view

c041e77

fix: avoid dynamic tensors in memory optimization v1

7a2a749

fix: all sofie model tests except for BatchNorm

6c24277

fix: flushing of occupied tensors from available memory

59f492b

fix: custom op output tensor names

a09b0d5

fix: optimization for custom operator

7666aea

feat: avoid pre-evaluation->evaluation in one go

f87fa7d

feat: coalesce adjacent memory while flushing

9e3337e

fix: custom opertor input should be const span

1f3bfb6

fix: intermediate tensor memory should be a transient variable

e5310db

fix: add map header file in sofie common

0ec7e39

fix: span in the allowedstdlib set

310e38b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 #17532

[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 #17532

sanjibansg commented Jan 27, 2025

github-actions bot commented Jan 27, 2025 •

edited

Loading

[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 #17532

Are you sure you want to change the base?

[TMVA][SOFIE] Memory Optimization for Intermediate tensors - v1 #17532

Conversation

sanjibansg commented Jan 27, 2025

github-actions bot commented Jan 27, 2025 • edited Loading

Test Results

github-actions bot commented Jan 27, 2025 •

edited

Loading