[WebNN] Support Cast fusion specific for int64 data type #23256

Honry · 2025-01-06T02:11:22Z

Some WebNN backends do not support the int64 data type, but this limitation can be addressed by converting the
model's int64 inputs, outputs, and initializers to int32. However, certain ONNX nodes, such as ArgMax, ArgMin,
and ScatterND, require the int64 data type for specific inputs or outputs.
To handle such case, we can add Cast nodes before or after these nodes in the model and fuse them during WebNN EP
optimization. The fusion strategy is as follows:

Verify if the Cast node can be fused with either the preceding node or the successive node.
Check if the node requiring the int64 data type can be supported solely by addressing the int64 data type
limitation. Ensure that the node is unsupported only due to the int64 restriction.
Use an is_fusable flag to record paired nodes as <Cast node index, fusable node index> that can be fused together.
Mark the fusable nodes as supported after identifying them.
During WebNN graph compilation, skip the Cast node and fuse it in its paired fusable node.

Some WebNN backends do not support the int64 data type, but this limitation can be addressed by converting the model's int64 inputs, outputs, and initializers to int32. However, certain ONNX nodes, such as ArgMax, ArgMin, and ScatterND, require the int64 data type for specific inputs or outputs. To handle such case, we can add Cast nodes before or after these nodes in the model and fuse them during WebNN EP optimization. The fusion strategy is as follows: 1. Verify if the Cast node can be fused with either the preceding node or the successive node. 2. Check if the node requiring the int64 data type can be supported solely by addressing the int64 data type limitation. Ensure that the node is unsupported only due to the int64 restriction. 3. Use an is_fusable flag to record paired nodes as <Cast node index, fusable node index> that can be fused together. 4. Mark the fusable nodes as supported after identifying them. 5. During WebNN graph compilation, skip the Cast node and fuse it in its paired fusable node.

Honry · 2025-01-06T02:11:48Z

@fdwr, @guschmue, PTAL, thanks!

guschmue · 2025-01-06T19:08:51Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2025-01-06T19:08:59Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2025-01-06T19:09:06Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-01-06T19:09:07Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2025-01-06T19:09:14Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-01-06T19:09:26Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-06T19:09:29Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-01-06T19:09:37Z

Azure Pipelines successfully started running 9 pipeline(s).

guschmue added the ep:WebNN WebNN execution provider label Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN] Support Cast fusion specific for int64 data type #23256

[WebNN] Support Cast fusion specific for int64 data type #23256

Honry commented Jan 6, 2025

Honry commented Jan 6, 2025

guschmue commented Jan 6, 2025

guschmue commented Jan 6, 2025

guschmue commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

guschmue commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

[WebNN] Support Cast fusion specific for int64 data type #23256

Are you sure you want to change the base?

[WebNN] Support Cast fusion specific for int64 data type #23256

Conversation

Honry commented Jan 6, 2025

Honry commented Jan 6, 2025

guschmue commented Jan 6, 2025

guschmue commented Jan 6, 2025

guschmue commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

guschmue commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025