Contains records of the official GNB data at each stage of data processing, along with the scripts used at each step. The stages of the data pipeline are as follows:
- data_raw: The raw data, provided directly by the Government of New Brunswick.
- data_xlsx: The data in
.xlsx
format, converted by raw_to_xlsx.py. - data_clean: The cleaned data, processed by xlsx_to_clean.py (and additional helper scripts in helper_scripts/xlsx_to_clean).
- data_final: The final data used for analysis, produced by clean_to_final.py (and additional helper scripts in helper_scripts/clean_to_final).
- data_analysis: [Add description later]