transformers_from_scratch My own implementation of the transformer (for educational purposes). Includes visualizations of the attention heads in action. Usage Run encode_data.py to encode a folder of text data Run minature.py to train Run vizualize.py to create visualizations