Inherit tags from nodes for modular pipelines #1878

jitu5 · 2024-04-25T13:37:16Z

Description

Related to #1822

Incase of expandModularPipelines = false and the filtered nodes happened to be inside a modular pipeline, it should show the modular pipeline node (collapsed view) which seems to be missing on refresh. This PR aims to resolve this issue.

Development notes

Screen.Recording.2024-04-26.at.10.27.21.a.m.mov

QA notes

Added tests to cover changes.

Checklist

Read the contributing guidelines
Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Added new entries to the RELEASE.md file
Added tests to cover my changes

Signed-off-by: Jitendra Gundaniya <[email protected]>

Signed-off-by: <>

package/kedro_viz/models/flowchart.py

Signed-off-by: Jitendra Gundaniya <[email protected]>

rashidakanchwala · 2024-04-30T21:39:59Z

Did a slightly different approach here main...inherit-tags .let me know what you think

ravi-kumar-pilla · 2024-04-30T22:08:18Z

Did a slightly different approach here main...inherit-tags .let me know what you think

I think this updates the tags at the same time we add inputs/outputs to the immediate parent. We are anyway updating the grandparents here - expanded_tree[parent_id].tags.update(modular_pipeline_node.tags). Looks good to me. Thank you

Signed-off-by: Jitendra Gundaniya <[email protected]>

package/kedro_viz/data_access/managers.py

rashidakanchwala

LGTM. thanks!!

Signed-off-by: Jitendra Gundaniya <[email protected]>

ravi-kumar-pilla · 2024-05-03T04:56:56Z

package/kedro_viz/data_access/repositories/modular_pipelines.py

@@ -161,6 +162,27 @@ def add_output(self, modular_pipeline_id: str, output_node: GraphNode):
        else:
            self.tree[modular_pipeline_id].external_outputs.add(output_node.id)

+    def add_tags(self, modular_pipeline_id: str, node_tags: set):


I was going through the modular pipelines code and found that we update the modular_pipeline pipelines set with node.pipelines. May be we can update the tags at that point too.

Like at line 239 in this file inside extract_from_node function having something like - modular_pipeline.tags.update(node.tags). In that case we do not need this function and also do not need a call at line 203 in managers.py. What do you think ? @rashidakanchwala

That's how I believe @jitu5 approached it previously.
He had some code in the extract_from_node function, which I found very confusing because the function was originally defined to extract namespaces, but now it's getting tags.
I think it's important to have clean functions that handle one task each, rather than one function doing too many things.

I'm also personally confused by the extract_from_node function being called three times here - link to GitHub code.
This is the exactly code refactor/simplification that Ivan mentioned to us the other day.

I agree with having clean functions. If you see our add_pipeline function, it does all the below tasks -

Populates the RegisteredPipelinesRepository

Populates the ModularPipelinesRepository

Populates the GraphNodesRepository

Apart from the above tasks it also creates the taskNode, DataNode, assigns inputs/outputs to the created nodes

Now if we add updating tags of the created modularPipeline, the function add_pipeline is already complex and we are adding this bit which only belongs to modularPipeline.

Yes, extract_from_node also does more than just extracting namespace, it does the below tasks but everything here belongs to modularPipeline -

Creates the modularPipeline

Updates the pipelines set with the node's pipelines

Adds the node as a child of the createdModularPipeline

We should probably rename this to something like add_modular_pipeline. Why I feel tags of the modular pipeline should be updated at this place, because this is where we create the modular pipeline using the node namespace and update the required attributes/properties. It is fine to have add_tags as a separate function which updates the tags set (not recursively though, how @jitu5 did initially) and call that from extract_from_node.

Also, I feel having clean functions is also about grouping tasks which belong to a particular section (in this case modularPipelines). We did this when adding resolve_dataset_factory_patterns which does the task of resolving dataset patterns in the catalog. But since this belongs to catalog we added it inside add_catalog. Again add_catalog is doing more than just adding catalog, it also adds tracking datasets (another function to refactor 🗡️ ).

Thank you

Signed-off-by: Jitendra Gundaniya <[email protected]>

ravi-kumar-pilla

LGTM 👍 !!

Inherit tags from nodes for modular pipelines

26b8900

Signed-off-by: Jitendra Gundaniya <[email protected]>

jitu5 requested a review from SajidAlamQB April 25, 2024 13:37

jitu5 and others added 4 commits April 25, 2024 14:37

Merge branch 'main' into feature/modular_pipeline_tag

15db91b

Lint fix

2e15a47

Signed-off-by: <>

Unit test fix

6e8dadc

Signed-off-by: <>

Unit test fix

218cfc8

Signed-off-by: <>

jitu5 self-assigned this Apr 25, 2024

jitu5 and others added 3 commits April 25, 2024 19:11

Unit test fix

1d8486c

Signed-off-by: <>

unit test fix

aab6d36

Signed-off-by: <>

Merge branch 'main' into feature/modular_pipeline_tag

2288268

ravi-kumar-pilla reviewed Apr 26, 2024

View reviewed changes

package/kedro_viz/models/flowchart.py Outdated Show resolved Hide resolved

jitu5 marked this pull request as ready for review April 26, 2024 15:15

jitu5 requested a review from rashidakanchwala as a code owner April 26, 2024 15:15

jitu5 and others added 3 commits April 26, 2024 16:47

tags removed

70e18e0

Signed-off-by: Jitendra Gundaniya <[email protected]>

done

0e9d51f

pipeline tree

42d466c

jitu5 added 6 commits May 1, 2024 16:09

Merge branch 'inherit-tags' into feature/modular_pipeline_tag

7a205b0

new approach for tags

c8f4983

Signed-off-by: Jitendra Gundaniya <[email protected]>

lint fix

b510696

Signed-off-by: Jitendra Gundaniya <[email protected]>

Lint fix

398929d

Signed-off-by: Jitendra Gundaniya <[email protected]>

lint fix

89d467a

Signed-off-by: Jitendra Gundaniya <[email protected]>

None check for pipeline_id added

4c78489

Signed-off-by: Jitendra Gundaniya <[email protected]>

rashidakanchwala reviewed May 2, 2024

View reviewed changes

package/kedro_viz/data_access/managers.py Outdated Show resolved Hide resolved

rashidakanchwala approved these changes May 2, 2024

View reviewed changes

Comment removed

f032b47

Signed-off-by: Jitendra Gundaniya <[email protected]>

ravi-kumar-pilla reviewed May 3, 2024

View reviewed changes

jitu5 and others added 2 commits May 3, 2024 17:52

New approach for modular pipeline tag

79e5c52

Merge branch 'main' into feature/modular_pipeline_tag

76f9ea6

Update test_modular_pipelines.py

d575a3f

Signed-off-by: Jitendra Gundaniya <[email protected]>

ravi-kumar-pilla approved these changes May 3, 2024

View reviewed changes

jitu5 merged commit 9e661f6 into main May 7, 2024
44 checks passed

jitu5 deleted the feature/modular_pipeline_tag branch May 7, 2024 09:19

rashidakanchwala mentioned this pull request May 29, 2024

Release/9.1.0 #1919

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inherit tags from nodes for modular pipelines #1878

Inherit tags from nodes for modular pipelines #1878

jitu5 commented Apr 25, 2024 •

edited

Loading

rashidakanchwala commented Apr 30, 2024 •

edited

Loading

ravi-kumar-pilla commented Apr 30, 2024

rashidakanchwala left a comment

ravi-kumar-pilla May 3, 2024

rashidakanchwala May 3, 2024

ravi-kumar-pilla May 3, 2024

ravi-kumar-pilla left a comment

Inherit tags from nodes for modular pipelines #1878

Inherit tags from nodes for modular pipelines #1878

Conversation

jitu5 commented Apr 25, 2024 • edited Loading

Description

Development notes

QA notes

Checklist

rashidakanchwala commented Apr 30, 2024 • edited Loading

ravi-kumar-pilla commented Apr 30, 2024

rashidakanchwala left a comment

Choose a reason for hiding this comment

ravi-kumar-pilla May 3, 2024

Choose a reason for hiding this comment

rashidakanchwala May 3, 2024

Choose a reason for hiding this comment

ravi-kumar-pilla May 3, 2024

Choose a reason for hiding this comment

ravi-kumar-pilla left a comment

Choose a reason for hiding this comment

jitu5 commented Apr 25, 2024 •

edited

Loading

rashidakanchwala commented Apr 30, 2024 •

edited

Loading