Cryo Data Ingest

The idea:

Discover data collections using a metadata provider (CMR in particular; we can make it pluggable in the future if other suitable providers exist)
Associate each collection with a cryo-data datalad repository
Add a data URL for each granule to the associated cryo-data repository
Routine updates: If a collection is still "active", i.e. producing new data, update the associated cryo-data repository to reflect all current granules in the dataset

Collection: A data product; a collection of datasets.

Granule: A dataset in a collection corresponding to a single measurement, e.g. a swath of data, a day of data, or 8 hours of data, depending on the resolution of the collection.

Usage

In early development. The following instructions are temporary.

Set up conda environment (conda env create)
Activate conda environment (conda activate cryo-data-ingest)

Run the "main script" from the root of this repo:

PYTHONPATH=. python cryo_data_ingest/util/cmr.py

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
cryo_data_ingest		cryo_data_ingest
.gitignore		.gitignore
README.md		README.md
environment-lock.yml		environment-lock.yml
environment.yml		environment.yml
json2datalad.sh		json2datalad.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cryo Data Ingest

Usage

About

Releases

Packages

Languages

cryo-data/cryo-data-ingest

Folders and files

Latest commit

History

Repository files navigation

Cryo Data Ingest

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages