Add test for deepseek_vl #1136

meenakshiramanathan1 · 2025-01-30T07:35:19Z

Model doesn't have huggingface direct version, hence model implementation is from corresponding github repository.
Model requires timm version>=0.9.16 which was conflicting with segmentation_models_pytorch package so both versions have been upgraded to resolve conflicts.
The custom generate function generates text by iteratively predicting tokens while maintaining a fixed sequence length using padding.
The model input shape is kept static by preallocating padded_inputs_embeds, which is updated each step
with new token embeddings corresponding to next_token_id until the sequence is complete.

Model is currently failing in verify with pcc drop of 0.96.

github-actions · 2025-01-30T07:51:42Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

nvukobratTT

Don't we have HuggingFace versions for which we don't need to copy model implementations?

github-actions · 2025-01-30T07:56:23Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-01-30T08:25:51Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-01-30T08:26:02Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-02-04T11:59:22Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-02-04T11:59:25Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-02-04T12:36:31Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-02-04T12:41:29Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek.forge.test.models.pytorch.multimodal.deepseek.test_deepseek	❌ failure

github-actions · 2025-02-05T10:19:16Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_vl.forge.test.models.pytorch.multimodal.deepseek.test_deepseek_vl	❌ failure

github-actions · 2025-02-05T10:19:17Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_vl.forge.test.models.pytorch.multimodal.deepseek.test_deepseek_vl	❌ failure

github-actions · 2025-02-06T18:11:51Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_vl.forge.test.models.pytorch.multimodal.deepseek.test_deepseek_vl	❌ failure

github-actions · 2025-02-06T18:16:24Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_vl.forge.test.models.pytorch.multimodal.deepseek.test_deepseek_vl	❌ failure

github-actions · 2025-02-06T18:26:00Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_vl.forge.test.models.pytorch.multimodal.deepseek.test_deepseek_vl	❌ failure

github-actions · 2025-02-14T19:34:09Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	660 ran	518 passed	142 skipped	0 failed

Test	Result
No test annotations available

ashokkumarkannan1 · 2025-02-17T08:57:50Z

forge/test/models/pytorch/multimodal/deepseek_vl/utils/load_model.py

+    return framework_model, vl_gpt, tokenizer, inputs_embeds
+
+
+def generation(max_new_tokens, model, inputs_embeds, tokenizer, vl_gpt):


Great work @meenakshiramanathan1 . Can we reuse this script for other generation models like t5, llama, and more? BTW, we can expect a conflict related to vl_gpt used here.

cc: @nvukobratTT May I know your thoughts in this?

meenakshiramanathan1 · 2025-02-17T10:54:52Z

Don't we have HuggingFace versions for which we don't need to copy model implementations?

Model doesn't have huggingface direct version, hence model implementation is from corresponding github repository.

github-actions · 2025-02-17T14:24:25Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_math_prefill.forge.test.models.pytorch.multimodal.deepseek_math.test_deepseek_math_prefill	❌ failure

github-actions · 2025-02-17T14:25:52Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_math_prefill.forge.test.models.pytorch.multimodal.deepseek_math.test_deepseek_math_prefill	❌ failure

github-actions · 2025-02-17T14:26:25Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_math_prefill.forge.test.models.pytorch.multimodal.deepseek_math.test_deepseek_math_prefill	❌ failure

github-actions · 2025-02-17T14:26:26Z

	Tests	Passed	Skipped	Failed ❌️
TT-Forge-FE Tests	1 ran	0 passed	0 skipped	1 failed

Test	Result
TT-Forge-FE Tests
pytest
test_deepseek_math_prefill.forge.test.models.pytorch.multimodal.deepseek_math.test_deepseek_math_prefill	❌ failure

…ps config (#1231) The [generate model ops test pipeline](https://github.com/tenstorrent/tt-forge-fe/actions/runs/13328380954/job/37226649520) is currently freezing during the unique ops configuration extraction phase Error: `Failed on "DecomposeEinsum" TVM callback` This error is encountered in the test case: `forge/test/models/pytorch/vision/detr/test_detr.py::test_detr_segmentation[facebook/detr-resnet-50-panoptic]` test cases. To prevent the extraction process from hanging indefinitely, a timeout of 1200 seconds (20 minutes) has been added. This ensures that if the unique ops configuration extraction takes too long, the test will be terminated.

github-actions · 2025-02-17T15:07:40Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	601 ran	480 passed	121 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-17T15:09:22Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	660 ran	520 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-17T15:09:27Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	660 ran	520 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-17T15:10:44Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	601 ran	480 passed	121 skipped	0 failed

Test	Result
No test annotations available

nvukobratTT

As this is a lot of new code that is mostly copied from GitHub, let me see can we get new repository for this purpose, and then link it to ours.

E.g. have a repo that will contain all third-party GitHub repositories as submodules, from which we can load this and similar models to run compile against them. That way, in FFE we'll only store model tests + wrappers if some modification is needed.

What do you think @meenakshiramanathan1 ?

meenakshiramanathan1 · 2025-02-25T04:24:19Z

As this is a lot of new code that is mostly copied from GitHub, let me see can we get new repository for this purpose, and then link it to ours.

E.g. have a repo that will contain all third-party GitHub repositories as submodules, from which we can load this and similar models to run compile against them. That way, in FFE we'll only store model tests + wrappers if some modification is needed.

What do you think @meenakshiramanathan1 ?

Sure @nvukobratTT , I will keep this in draft for now.

ashokkumarkannan1 · 2025-02-25T04:28:04Z

As this is a lot of new code that is mostly copied from GitHub, let me see can we get new repository for this purpose, and then link it to ours.

E.g. have a repo that will contain all third-party GitHub repositories as submodules, from which we can load this and similar models to run compile against them. That way, in FFE we'll only store model tests + wrappers if some modification is needed.

What do you think @meenakshiramanathan1 ?

That sound good @nvukobratTT . We can do that. As we are going to focus on P1 models and models supported in tt-torch which is not yet supported in tt-forge-fe. What will be the priority of moving all the GitHub copied code to a new repo? Because there are some other models also fall in this case as you know. Could you please clarify this?

nvukobratTT · 2025-02-25T13:50:37Z

As this is a lot of new code that is mostly copied from GitHub, let me see can we get new repository for this purpose, and then link it to ours.
E.g. have a repo that will contain all third-party GitHub repositories as submodules, from which we can load this and similar models to run compile against them. That way, in FFE we'll only store model tests + wrappers if some modification is needed.
What do you think @meenakshiramanathan1 ?

That sound good @nvukobratTT . We can do that. As we are going to focus on P1 models and models supported in tt-torch which is not yet supported in tt-forge-fe. What will be the priority of moving all the GitHub copied code to a new repo? Because there are some other models also fall in this case as you know. Could you please clarify this?

Synced offline

meenakshiramanathan1 requested review from vkovinicTT, mstojkovicTT, nvukobratTT, pilkicTT and dgolubovicTT as code owners January 30, 2025 07:35

meenakshiramanathan1 marked this pull request as draft January 30, 2025 07:35

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from d680aab to 07bb2f6 Compare January 30, 2025 07:37

nvukobratTT reviewed Jan 30, 2025

View reviewed changes

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from 07bb2f6 to b774e54 Compare January 30, 2025 08:07

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from b774e54 to 42bf67b Compare February 4, 2025 11:38

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch 4 times, most recently from eccc9ef to 9baabd6 Compare February 5, 2025 08:52

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch 3 times, most recently from a9a7143 to 563b03e Compare February 6, 2025 15:29

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch 3 times, most recently from adfe853 to 273ee1b Compare February 17, 2025 07:56

ashokkumarkannan1 reviewed Feb 17, 2025

View reviewed changes

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from 273ee1b to d75cca4 Compare February 17, 2025 09:55

meenakshiramanathan1 requested a review from ashokkumarkannan1 February 17, 2025 10:02

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from d75cca4 to 8373b96 Compare February 17, 2025 10:20

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch 2 times, most recently from b3431b2 to 20c7194 Compare February 17, 2025 14:02

meenakshiramanathan1 marked this pull request as ready for review February 17, 2025 14:09

meenakshiramanathan1 requested review from nvukobratTT and removed request for pilkicTT and dgolubovicTT February 17, 2025 14:09

meenakshiramanathan1 force-pushed the mramanathan/deepseek branch from 20c7194 to 76b1781 Compare February 17, 2025 14:29

nvukobratTT reviewed Feb 24, 2025

View reviewed changes

meenakshiramanathan1 marked this pull request as draft February 25, 2025 04:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for deepseek_vl #1136

Add test for deepseek_vl #1136

meenakshiramanathan1 commented Jan 30, 2025 •

edited

Loading

github-actions bot commented Jan 30, 2025

nvukobratTT left a comment

github-actions bot commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 5, 2025

github-actions bot commented Feb 5, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 14, 2025

ashokkumarkannan1 Feb 17, 2025

meenakshiramanathan1 commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

nvukobratTT left a comment

meenakshiramanathan1 commented Feb 25, 2025

ashokkumarkannan1 commented Feb 25, 2025

nvukobratTT commented Feb 25, 2025

		return framework_model, vl_gpt, tokenizer, inputs_embeds


		def generation(max_new_tokens, model, inputs_embeds, tokenizer, vl_gpt):

Add test for deepseek_vl #1136

Are you sure you want to change the base?

Add test for deepseek_vl #1136

Conversation

meenakshiramanathan1 commented Jan 30, 2025 • edited Loading

github-actions bot commented Jan 30, 2025

nvukobratTT left a comment

Choose a reason for hiding this comment

github-actions bot commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 4, 2025

github-actions bot commented Feb 5, 2025

github-actions bot commented Feb 5, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025

github-actions bot commented Feb 14, 2025

ashokkumarkannan1 Feb 17, 2025

Choose a reason for hiding this comment

meenakshiramanathan1 commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

github-actions bot commented Feb 17, 2025

nvukobratTT left a comment

Choose a reason for hiding this comment

meenakshiramanathan1 commented Feb 25, 2025

ashokkumarkannan1 commented Feb 25, 2025

nvukobratTT commented Feb 25, 2025

meenakshiramanathan1 commented Jan 30, 2025 •

edited

Loading