This is a list of resources (courses, tutorials, etc.) in the field of Computational and Structural Biology. This is for those with no background in biology but who want to work in this field.
I highly expect you to contribute to this project if you have any ideas on related topics. One person can go fast, but a group of people can go further.
The first step is to learn the very fundamental concepts in biology, like what is an amino acid? What are proteins made of? You don't have to memorize every detail, but it is essential to be familiarized with those conceptual words. After this stage, you should understand the basic structure of proteins, e.g., the four levels of structure.
-
Workshop: Machine Learning in Structural Biology (neurips.cc)
-
Course 81855: Workshop in Computational Structural Biology (huji.ac.il)
-
"生物学" 免费教程 - Structural Biology 101: A Beginner' 's Guide | Udemy
-
Introduction to proteins: structure, function, and motion TAU online course | Mysite (bentalab.com)
-
This article provides useful resources about Protein Structural Biology. How to Introduce Students to Protein Structural Biology (routledge.com)
-
rsgturkey/Computational_Structural_Biology_Workshop (github.com)
-
Structural Biology 228 | Computational Structural Biology (stanford.edu)
-
CSCI4969-6969 Machine Learning in Bioinformatics | Zaki Home Page (rpi.edu)
-
Biological Macromolecules computational medicine - Biology (google.com)
-
- DNA part 1 and part 2 (Khan Academy)
- RNA (Khan Academy)
- Protein Structure (Gerry Bergstrom) and Function (Khan Academy)
- Carbohydrates and Lipids (Craig Savage)
- Khan Academy Macromolecules
- Khan Academy Biomolecules
- Khan Academy Medicine Playlist on Proteins
- Khan Academy Medicine Playlist on DNA and RNA
- Khan Academy Medicine Playlist on Lipids, and Carbohydrates
- DNA Learning Center Playlist on DNA Structure and Function
-
"生物信息学" 免费教程 - Conducting Introductory Computational Biology Research | Udemy
-
- This is a great course, but it focuses more on the genome
-
Syllabus | Foundations of Computational and Systems Biology | Biology | MIT OpenCourseWare
'Morden' computational methods for biology are dominated mainly by neural networks, the most famous is Alphafold2.
Alphafold2 achieves an atomic level of accuracy in the protein structure prediction task. It gives a solution to a problem that has not been answered for 50 years. The model capacity is gigantic, and the design is sophisticated. To understand the paper, I list several tutorials that describe the mechanisms of Alphafold2. I think that many operations in Alphafold2 are no biologically or physically meaningful; instead, they just try to find a method to scale up the model capacity effectively and thus achieve better performance. But it does not mean I don't like Alphafold2. The uses of the structure module and the representation of the atom's 3D coordinate are as brilliant as hard to understand.
- AlphaFold: a solution to a 50-year-old grand challenge in biology (deepmind.com)
- Highly accurate protein structure prediction with AlphaFold | Nature
- Video tutorial
- Article
- Survey paper: Machine learning in protein structure prediction (sciencedirectassets.com)
- Code Implementation
-
- lucidrains/alphafold2: To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released (github.com) This code is not 100% pure implementation of Alphafold2, where many modifications and improvement are devloped for better performance.
- aqlaboratory/openfold: Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2 (github.com)
"What I cannot create, I do not understand." -Richard Feynman.
As I have stated, most current methods focus on developing neural networks for bio problems. If you have a fair understanding of deep learning, you can find a paper to read and start your project. If you are a beginner in deep learning (DL), there is a long way to go. Many good materials on deep learning are publicly available; please look them up. Either way, I recommend you reproduce a SOTA method in the field of computational biology. If you don't know what paper to reproduce, I think Alphafold2 is a great starting point! That will ground you up for your project.
I also list several beginner-level excises below.
- sinadadmand/CHBI522: Computational Structural Biology (github.com)
- pjmartel/compbio: Materials for the computational structural biology course. (github.com)
-
awesome lists