This is a basic Calkit example project, which includes data collection, figure generation, and LaTeX article compilation, all part of a single reproducible DVC pipeline.
You can also view this project on calkit.io.
This project can be reproduced with:
calkit run
- Create a new project on calkit.io.
calkit new question "Can we make reproducibility simple?"
.- Create a Conda environment with
calkit new conda-env -n calkit-example-basic python matplotlib numpy pandas
. - Create the data collection stage with:
dvc stage add -n collect-data \ -o data/raw/data.csv -d \ scripts/collect-data.py \ "calkit runenv -n calkit-example-basic -- python scripts/collect-data.py"
calkit run && git add . && calkit save -am "Run pipeline"
- Add a script and DVC stage to create a figure, similar to the two steps above.
- Add the figure and dataset to
calkit.yaml
by editing thefigures
anddatasets
section manually. - Create a new LaTeX publication and Docker environment using the
latex/article
template. See the docs.