Support for Cyclic Dependencies in Kedro Pipelines for Reinforcement Learning Scenarios #3817
Sino-Huang
started this conversation in
Idea
Replies: 1 comment 1 reply
-
Hi @Sino-Huang , thanks for opening this! I converted it to a discussion. Typically, what other frameworks exist for reinforcement learning? How do you define your convergence criteria so that the pipeline eventually stops? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Description
I'm currently facing challenges with the Kedro pipeline structure, specifically its limitation to Directed Acyclic Graphs (DAGs). In reinforcement learning applications, the ability to create cyclic loops within the pipeline is crucial. For instance, a learning policy generates data that is then used to further train and refine the same policy. The current DAG structure does not support these types of cyclic dependencies, which is limiting for projects that involve iterative data generation and processing loops.
Context
The addition of support for cyclic dependencies is important because it would allow for more flexible pipeline configurations, especially beneficial in the context of AI and machine learning projects where iterative feedback loops are common. This feature would not only benefit my projects but also broaden Kedro's applicability in advanced machine learning scenarios, promoting its adoption and enhancing its utility.
Possible Implementation
One way to implement this could be by allowing users to define nodes or sub-pipelines that can conditionally loop back to earlier stages based on runtime data or conditions. Or let's have a counter to count the loop.
Beta Was this translation helpful? Give feedback.
All reactions