Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[data lineage] Showing the user what tables & corresponding columns got changed while loading into the database. #132

Open
redicane opened this issue Feb 16, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@redicane
Copy link
Contributor

DLT will change any column and table name to lower case and will also replace spaces with '_'. As to not confuse the user of the pipeline, it might be helpful to keep track and show these changes after the pipeline has finished loading data into the database. This could be done in load info (showing just how many name changes occurred), logging info every time there is a name change and this information should also be in the schema after the pipeline has finished loading.
Additionally if a column only has NULL values for all rows, it will be dropped. I think this should also be shown to a user. Basically any change of this sort where a user expects thing A to be loaded and thing B (or nothing) gets loaded should be shown to the user.

@rudolfix
Copy link
Collaborator

@redicane

Additionally if a column only has NULL values for all rows, it will be dropped
then it is a bug... I will create a ticket
this information should also be in the schema after the pipeline has finished loading
I'm planning to add lineage information to the schema. I'll attach this ticket to it

@rudolfix rudolfix added the enhancement New feature or request label Mar 8, 2023
@rudolfix rudolfix changed the title Showing the user what tables & corresponding columns got changed while loading into the database. [data lineage] Showing the user what tables & corresponding columns got changed while loading into the database. Mar 8, 2023
@rudolfix rudolfix moved this from Todo to Planned in dlt core library Aug 18, 2023
@rudolfix rudolfix moved this from Planned to Todo in dlt core library Aug 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Todo
Development

No branches or pull requests

2 participants