Project4 MLDevOps. A Dynamic Risk Assessment System

Author: Ernesto Saavedra, February 2022

Overview

In this project, we create, deploy, and monitor a risk assessment ML model that will estimate the attrition risk of a company's clients. We will set up regular monitoring of the model to ensure that it remains accurate and up-to-date. We will set up processes and scripts to re-train, re-deploy, monitor, and report on the obtained ML model, so that the company can get risk assessments that are as accurate as possible and minimize client attrition.

Project Steps Overview

You'll complete the project by proceeding through 5 steps:

Data ingestion. Automatically check a database for new data that can be used for model training. Compile all training data to a training dataset and save it to persistent storage. Write metrics related to the completed data ingestion tasks to persistent storage.
Training, scoring, and deploying. Write scripts that train an ML model that predicts attrition risk, and score the model. Write the model and the scoring metrics to persistent storage.
Diagnostics. Determine and save summary statistics related to a dataset. Time the performance of model training and scoring scripts. Check for dependency changes and package updates.
Reporting. Automatically generate plots and documents that report on model metrics. Provide an API endpoint that can return model predictions and metrics.
Process Automation. Create a script and cron job that automatically run all previous steps at regular intervals.

Implementation

The scripts in this folder implement all requested steps defined in the Project Steps Overview.
Following some explanations regarding the implementation:

To run all the scripts in mode 'practicemodels' or 'models' simply change the corresponding values in config.json and run the script fullprocess.py
The files confusionmatrix.png, apireturns.json are generated automatically according to the settings in config.json
- confusionmatrix.png => practicemodels/confusionmatrix_practicemodel.png or models/confusionmatrix_models.png
- apireturns.json -> practicemodels/apireturns_practicemodel.png or models/apireturns_models.png
The file fullprocess.log contains time-based logs of fulprocess.py

Re-deployment
In the 'Model Deployment' part of project instructions it is stated that re-deployment should simply copy existing files. However, it might happen that a new model, which does not perform well on test data, might overwrite the previous model with better performance. Therefore, before copying the files we first compare the latestscore from the last deployed model with the new model score, which is the score on test data. If the new score is higher then new files and models are copied to the deployment folder

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ingesteddata		ingesteddata
models		models
practicedata		practicedata
practicemodels		practicemodels
production_deployment		production_deployment
sourcedata		sourcedata
testdata		testdata
.gitignore		.gitignore
README.md		README.md
apicalls.py		apicalls.py
app.py		app.py
app_json_output.py		app_json_output.py
app_session.py		app_session.py
app_str_output.py		app_str_output.py
commons_proj.py		commons_proj.py
config.json		config.json
config_json.png		config_json.png
config_practicedata.json		config_practicedata.json
config_production.json		config_production.json
cronjob.sh		cronjob.sh
cronjob.txt		cronjob.txt
cronjob_test.py		cronjob_test.py
cronjob_test.sh		cronjob_test.sh
cronjob_test.txt		cronjob_test.txt
deployment.py		deployment.py
diagnostics.py		diagnostics.py
fullprocess.py		fullprocess.py
ingestion.py		ingestion.py
load_responses.py		load_responses.py
outdated.txt		outdated.txt
reporting.py		reporting.py
requirements.txt		requirements.txt
scoring.py		scoring.py
training.py		training.py
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project4 MLDevOps. A Dynamic Risk Assessment System

Overview

Project Steps Overview

Implementation

About

Releases

Packages

Languages

esm2046gh/mldevops_project4

Folders and files

Latest commit

History

Repository files navigation

Project4 MLDevOps. A Dynamic Risk Assessment System

Overview

Project Steps Overview

Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages