You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are working with a lot of different sources of data, without any source of truth or centralized repository of clean datasets.
That is an issue because there are some proprietary data sources that cannot be shared publicly (e.g uploaded to the public repo) and also people working with their own datasets locally makes reproduction of their work very difficult.
It would be useful to create some structure so that specific analyses and their input and output data would be tied together (pipelines) and also have some kind of data store that can act as source of truth.
We can look at what the Zetkin infra looks like, and if there is already some cloud database that we can piggyback off of.
The text was updated successfully, but these errors were encountered:
We are working with a lot of different sources of data, without any source of truth or centralized repository of clean datasets.
That is an issue because there are some proprietary data sources that cannot be shared publicly (e.g uploaded to the public repo) and also people working with their own datasets locally makes reproduction of their work very difficult.
It would be useful to create some structure so that specific analyses and their input and output data would be tied together (pipelines) and also have some kind of data store that can act as source of truth.
We can look at what the Zetkin infra looks like, and if there is already some cloud database that we can piggyback off of.
The text was updated successfully, but these errors were encountered: