A PyTorch Lightning reimplementation of ChromBPNet developed during fall BMI rotation with the Corces Lab. ChromBPNet is a deep learning model for predicting chromatin accessibility from DNA sequence.
base_modules.py
: Contains the fundamental building blocks of the neural network architecture_module.py
: Arranges the base modules into the complete model architecture_model.py
: Contains the PyTorch Lightning trainer class implementation_data.py
: Implements data loading and preprocessing functionality
notebooks/
: Jupyter notebooks used for model development, testing, and comparisons with the original ChromBPNet implementationmicroglia_train.sh
andmicroglia_train_new.sh
: Training scripts for microglia-specific experiments
adjust_bed.py
: Utilities for handling BED file format adjustmentsattention_utils.py
: Helper functions for attention mechanismsdata_utils.py
: General data processing utilitieslosses.py
: Loss function implementations (Note: NLL implementation currently in progress)one_hot.py
: Functions for one-hot encoding of DNA sequencesshape_utils.py
: Utilities for handling tensor shapes and transformations
- Complete NLL loss implementation
- Personalized Genome integration (paired WGS, scATAC-Seq)