Skip to content

Bachelor thesis about integration of new technologies and approaches like AIrflow and Mongo into DAFOS data warehouse

Notifications You must be signed in to change notification settings

jevhen-ponomarenko/DAFOS-DWH-extension-thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bachelor thesis

Hi, this is my bachelor thesis about extension of data warehouse. The thesis covers following points:

  • Describe DAFOS in the context of data warehousing
  • Automation of ETL using Apache Airflow
  • Incremental updates of data for optimization of ETL
  • Implementation of staging area with query capabilities using MongoDB
  • Centralized log storage
  • Lineage of data
  • Integration of new data source: European Patent Office
  • Developing a basic testing strategy for code that uses newly added technologies

How to build

pdfcsplain main.tex; pdfcsplain main.tex; open main.pdf

You need to compile the tex file twice, because on the first iteration only references to images are created, on the second run they are placed into the document.

About

Bachelor thesis about integration of new technologies and approaches like AIrflow and Mongo into DAFOS data warehouse

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages