Skip to content

Django app that collects data from parquet files, provides the data in json format, and then consumes and returns the data.

Notifications You must be signed in to change notification settings

Utlak88/NarrativeWave-App

Repository files navigation

NarrativeWave App

This application has two primary features:

  1. Utilizes a REST API service to collect data from parquet files and provides the data in json 'record' format.
  2. Consumes the REST API service and returns the json-formatted data in the browser.

Other features of the app include:

  • Asset and column models with a one-asset-to-many-column relation
  • Page that shows all possible parquet queries

A function has also been developed in the 'read_csv_and_store_parquet.py' that uses PySpark to convert csv data into wide-formatted parquet files partitioned by asset, year, and month.

Pertaining to Django testing, Form, Model, and View tests have been developed and provide 100% coverage.

Additional File

A Google Colaboratory notebook explaining the steps to process long-formatted csv files into wide-formatted partitioned parquet files has also been included in this repo.

Credits

Developed by Stephen Utlak.

License

This software is available under the MIT license.

About

Django app that collects data from parquet files, provides the data in json format, and then consumes and returns the data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published