This repo contains the code that accompanies the paper "Critical Analysis of Deconfounded Pretraining to Improve Visio-Linguistic Models". Currently, the code in it hasn't been cleaned up yet. If I notice a lot of people want to build on it / reproduce it, I will update it with more detailed instructions of how to reproduce the results in the paper, along with cleaned-up code.