feat: Add `created_at` and `updated_at` to all tables #3225

MartinquaXD · 2025-01-08T14:47:00Z

Problem

This was sparked by discussion regarding data analytics efforts. Mirroring the contents of a DB are easier if these fields are available. Also it's generally nice to have timestamps for all sorts of data.

Acceptance criteria

All tables receive created_at and updated_at columns storing timestamps of the respective event.
All historic data gets backfilled with the timestamp when the migration gets applied.

The text was updated successfully, but these errors were encountered:

sunce86 · 2025-01-08T15:09:06Z

Can you give us more details why this is exactly needed?

I'd like us to challenge every decision to add data to backend database that is not directly used by backend services (unless really necessary which seems like it is in this case). Otherwise, we keep on adding dependencies on other teams that use our database directly and it's getting increasingly harder to make backend changes without breaking other stuff.

MartinquaXD · 2025-01-08T16:37:36Z

Overall I personally feel like created_at and updated_at (that less so) are generally things that make sense to have in any DB. Besides that this suggestion came up to aid the efforts of creating a unified and reliable data pipeline for any analytics needs. This fields can be used to optimize synchronizing backend and analytics DB after we applied a migration to the backend DB.

fleupold · 2025-01-08T17:15:54Z

This has come up multiple times in the past (e.g. when discussing data bucketing by month which requires a timestamp for the solver rewards accounting script to know in which bucket a certain settlement falls and recently when adjusting the tenderly web3 action).

I believe the solver team now built a workaround to fetch the timestamp for trades and in the web3 action we have to add a ton of network specific logic in order to express something like "check if the trade was less than 24h ago".

I agree with your worry that there is risk in adding domain specific columns which other teams depend on and that make it harder for us to refactor later. However, timestamps of when a db row is created (and updated) is not domain specific and doesn't contribute to the risk you mentioned (there are DBs which store this metadata by default, unfortunately postgres is not one of them).

ferrau10 · 2025-01-09T14:17:27Z

For the data pipeline this would be really useful: we copy data from the backend db to the analytics db. We will do a one time full load of all the tables we use in the pipeline, and then automate incremental loads, using dune synch v2. We could of course either look at the index of source table and only copy data when the index does not exist in the target table but this can be quite heavy for large datasets and we need to be sure each table has an index. And in case a row is updated but the index does not change, we would not catch the change. We could also look at tables that have columns that increase like block number and only load the rows where block number is higher than the max block number we have in the analytics db. BUT, if there is every a block that somehow was added to the backend db later on, then we would not catch that. AND, as far as I now , not all tables have a column that increases linearily.
Generally, loading data from source table to target table where the max(updated_at) of target_table is lower than the updated_at of the row is very easy, reliable, and it also simplifies the process as it's the same for all tables.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add `created_at` and `updated_at` to all tables #3225

feat: Add `created_at` and `updated_at` to all tables #3225

MartinquaXD commented Jan 8, 2025

sunce86 commented Jan 8, 2025 •

edited

Loading

MartinquaXD commented Jan 8, 2025

fleupold commented Jan 8, 2025

ferrau10 commented Jan 9, 2025

feat: Add created_at and updated_at to all tables #3225

feat: Add created_at and updated_at to all tables #3225

Comments

MartinquaXD commented Jan 8, 2025

Problem

Suggested solution

Acceptance criteria

sunce86 commented Jan 8, 2025 • edited Loading

MartinquaXD commented Jan 8, 2025

fleupold commented Jan 8, 2025

ferrau10 commented Jan 9, 2025

feat: Add `created_at` and `updated_at` to all tables #3225

feat: Add `created_at` and `updated_at` to all tables #3225

sunce86 commented Jan 8, 2025 •

edited

Loading