Two different scripts for modeling the Corporación Favorita dataset:
The sales of three products that are among the most sold are modeled using linear regression, ridge regression and XGBoost, and monthly predictions are made and plotted.
Variable importances are also plotted for the three products using these three different models.
Interestingly the linear and XGBoost models use lagged sales as their most important predictors, but the ridge models do not.
Sales forecasting done on each product-store pair using lasso regression
Includes a function for plotting predictions and actuals. Results of the models are below.