Final project for CMU 10701 Introduction to Machine Learning for PhDs.
The notebook implements a encoder-decoder model with scheduled sampling and beam search. The model is trained on the Microsoft Common Objects in Context (MSCOCO) data. For implementation details, please check out the notebook and the report.