This repository is a central hub for bioinformatics tutorials, project guides, tools, and articles I've created. It houses a growing collection of resources I've developed to make bioinformatics more approachable and accessible. Whether you’re a beginner learning the basics or a researcher exploring more complex analyses, these resources are designed to help you navigate the world of bioinformatics.
Programming is at the heart of bioinformatics. This section provides tutorials, and articles, on essential skills to help biologists build strong programming foundations and effectively apply computational methods to solve biological problems.
- Python Fundamentals For Biologists: This extended guide offers a hands-on approach to learning Python, focusing on topics such as control flow, functions, data handling, and data visualization. It's tailored for biologists who want to use Python for data analysis and visualization.
- Bash Fundamentals For Bioinformatics: Learn how to navigate the command line, automate repetitive tasks, and execute bioinformatics workflows with this interactive introduction to Bash.
- Math For Computational Biology: This repository provides tutorials that cover mathematical concepts foundational to bioinformatics and computational biology (including calculus, ordinary differential equations, and linear algebra), offering a solid preparation for graduate level coursework.
- Catching Silent Errors in Bioinformatics Code with Assert Statements: Discover how to use assert statements to debug and validate your code, preventing silent errors that can lead to incorrect biological interpretations.
- Looping Through Repetitive Tasks: Automating workflows is a critical skill in bioinformatics. This beginner's guide introduces command-line loops to streamline repetitive bioinformatics tasks.
This section contains detailed guides and DIY tutorials for performing a range of bioinformatics analyses, from functional enrichment analysis to PPI network analysis, enabling researchers to derive meaningful insights from biological data.
- Introduction to Functional Enrichment Analysis: A beginner-friendly guide to gene ontology, pathway, and gene set enrichment analysis.
- From Genes to Networks: Uncovering Biological Insights through Protein-Protein Interactions: This tutorial walks you through mapping DEGs to PPI networks, identifying hubs, and simulating genetic circuits to uncover biological insights.
- Unraveling Gene Co-Expression Networks: Insights into Transcriptional Regulation: Learn how to construct and interpret gene co-expression networks to investigate transcriptional regulation in this practical guide.
- BLAST - The Biological Search Engine: This resource covers BLAST tools for sequence alignment, offering practical examples for identifying homologous sequences.
- Identifying and Visualizing Marker Genes with ScanPy: A step-by-step guide to analyzing single-cell RNA-seq data and identifying marker genes using ScanPy.
- Connecting the Dots: A Guide to Multi-Omics Data: Understanding the interrelationships between genomic, epigenomic, transcriptomic, proteomic, metabolomic, phenomic, and stimulus data
- Understanding Mutual Information in Bioinformatics: A Deep Dive into Entropy, Mutual Information, and Their Application in Gene Expression Analysis
This section showcases in-depth project walkthroughs, demonstrating how bioinformatics techniques can be applied to real-world research questions.
- Decoding The Viral Proteome: Analyze the herpes virus proteome to uncover patterns in amino acid frequencies (beginner friendly).
- Amiloride in Multiple Myeloma: A Deep Dive into RNA-Seq Data Analysis: Explore RNA-seq data to investigate Amiloride's mechanism of action and its impact on gene expression in multiple myeloma treatment.
- Investigating Transcriptional Changes in Alzheimer's Disease Using a 3xTg-AD Mouse Model: Uncover transcriptional changes and molecular mechanisms driving Alzheimer’s disease progression in the insular cortex of 3xTG-AD mice using RNA-seq data.
- Exploring mutual features between the 5xFAD, 3xTG-AD, and PS3O1S Alzheimer's Disease Mouse Models: Explore the similarities, differences, and overlapping features—including differentially expressed genes (DEGs), Gene Ontology (GO) terms, biological pathways, and protein-protein interaction (PPI) networks—between three mutant mouse models of Alzheimer’s disease (AD).
Systems biology focuses on understanding complex biological networks. This section provides tools and tutorials to model dynamic systems and simulate biological processes.
- Numerical Methods & Systems Biology Toolkit: A collection of numerical methods and tools for solving common systems biology problems, from ordinary differential equations to parameter optimization.
- An Introduction To Dynamical Systems & Markov Models: A primer on modeling biological systems using dynamical systems and Markov models, with practical examples.
- Understanding the SIR Model: A Tool for Modeling Infectious Diseases: Learn how to use the SIR model to simulate and understand the spread of infectious diseases.
- The Biology Network: Learn how static and dynamic network models can be used simulate and analyze biological systems, from metabolic pathways to regulatory circuits.
Machine learning offers powerful tools for analyzing and interpreting biological data. This section provides resources to evaluate models, optimize performance, and compare algorithms for bioinformatics applications.
- Evaluating Machine Learning Models: An Introduction to Resampling Methods: Learn how to use resampling methods like cross-validation to assess model performance.
- Evaluation Metrics For Classification Problems: A comprehensive guide to metrics for evaluating classification models.
- Evaluation Metrics For Regression Problems: A comprehensive guide to metrics for evaluating regression models.
- Comparing Machine Learning Algorithms: Spot Checking For Regression and Classificaiton Problems: Learn how to spot-check regression and classification models to identify the best-performing algorithm for your data.
- Introduction To Hyperparameter Optimization: Master hyperparameter tuning to optimize model performance and achieve better results.