-
betteromics
- CA
- https://gitlab.com/lynxoid
Highlights
- Pro
Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
SQLite3 extension for read/write storage compression with Zstandard
PRML algorithms implemented in Python
samblaster: a tool to mark duplicates and extract discordant and split reads from sam files.
Preprocessing paired-end reads produced with experiment-specific protocols
Statistical Rethinking course and book package
Microassembly based somatic variant caller for NGS data
A General-Purpose Counting Filter: Counting Quotient Filter
Travis CI and deployment service to build PDF from LaTeX document.
The BTL C/C++ Common bloom filters for bioinformatics projects, as well as any APIs created for other programming languages.
A code-searching tool similar to ack, but faster.
Automatically exported from code.google.com/p/smhasher
🌈Scaffold genome sequence assemblies using linked or long read sequencing data
Minimal and clean examples of machine learning algorithms implementations
Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads. http://genome.cshlp.org/content/early/2017/01/18/gr.214270.116 Note: This was the original repository which wil…
RTG Core: Software for alignment and analysis of next-gen sequencing data.
Shellscript to delete orphaned docker volumes
A single molecule sequence assembler for genomes large and small.
Library of different Bloom filters in Java with optional Redis-backing, counting and many hashing options.