Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support mlflow.log_table #634

Open
qwe313 opened this issue Feb 16, 2025 · 0 comments
Open

Support mlflow.log_table #634

qwe313 opened this issue Feb 16, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@qwe313
Copy link

qwe313 commented Feb 16, 2025

Description

To manually evaluate some outputs of a run in the Evaluation tab in the MLflow UI, you need to log data using the mlflow.log_table method, which is not currently supported by any of the Dataset implementations in kedro-mlflow.

Context

I wanted to be able to view the table I created in one of the nodes in the Evaluation tab to compare it with other runs.
I've tried logging JSON as an artifact using MlflowArtifactDataset and pandas.JSONDataset with orient: "split" (since this is the same format MLflow uses when logging artifacts with mlflow.log_table).

As a workaround, I manually call mlflow.log_table from inside the node at the end of execution, but I am not sure if this is reliable and will always log data to the correct run, as I was unable to retrieve the run_id of the current run from inside the node.

Possible Implementation

Probably additional Dataset that will use mlflow.log_table instead of mlflow.log_artifact

Possible Alternatives

Manual logging but this is probably not very reliable

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: 🔖 Ready
Development

No branches or pull requests

2 participants