Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Looking for a PostgreSQL and MongoDB dump between 1TB and 30TB of data? #35

Open
evoxmusic opened this issue Mar 30, 2022 · 5 comments
Open
Labels
help wanted Extra attention is needed question Further information is requested

Comments

@evoxmusic
Copy link
Contributor

Hi, I am looking for a PostgreSQL and MongoDB data dump between 1TB and 30TB. The idea would be to build a pipeline test of RepliByte with a very large dump file. It will help a lot to optimize RepliByte. Anyone?

@evoxmusic evoxmusic added help wanted Extra attention is needed question Further information is requested labels Mar 30, 2022
@Vagelis-Prokopiou
Copy link

I don't know if this suggestion will help you, but an alternate solution would be to create your own dump, if no one can provide one.
A simple script would be able to prepare the db with the amount of data you want.

@evoxmusic
Copy link
Contributor Author

It's a good idea and easier today I guess. Happy to have your contrib if it's doable for you :)

@Vagelis-Prokopiou
Copy link

Would be happy to help but creating such a dump is not feasible for me due to insufficient hard drive space :-)

@markrechler
Copy link
Contributor

For PostgreSQL, you could generate the data with pgbench, something along the lines of:

psql -c 'CREATE DATABASE pg_test'
pgbench -i -s 75000 pg_test

The scaling factor is based on an older wiki entry:
https://wiki.postgresql.org/wiki/Pgbenchtesting

@evoxmusic
Copy link
Contributor Author

Thank you, super useful

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants