Improve Postgres subsetting performance #51

evoxmusic · 2022-04-24T11:37:53Z

I've tried to subset a Postgres DB of 2GB of data and RepliByte took 38 minutes to complete. I suspect the function subset.postgres.filter_insert_into_rows(..) to be the bottleneck since it is called multiple times and scan the entire file (even if there is a small index).

Something that can be done to drastically reduce the time would be to split the dump into multiple table files. Then scan will be limited to the table.

The text was updated successfully, but these errors were encountered:

evoxmusic · 2022-04-24T11:38:34Z

Note: performances depend on how deep is the graph. My testing database contains 41 tables.

evoxmusic added the enhancement New feature or request label Apr 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Postgres subsetting performance #51

Improve Postgres subsetting performance #51

evoxmusic commented Apr 24, 2022

evoxmusic commented Apr 24, 2022

Improve Postgres subsetting performance #51

Improve Postgres subsetting performance #51

Comments

evoxmusic commented Apr 24, 2022

evoxmusic commented Apr 24, 2022