Skip to content

naobservatory/basecall-workflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Nanopore Basecalling Workflow

This Nextflow pipeline is designed to process Oxford Nanopore raw signal data (POD5 files) through basecalling and optional demultiplexing steps. It supports both simplex and duplex basecalling modes using Dorado.

Pipeline Description

Overview

The pipeline consists of a single workflow that processes Nanopore POD5 files through several phases:

  1. A batching phase where POD5 files are grouped into batches to allow parallelized basecalling
  2. A basecalling phase using Dorado in either simplex or duplex mode
  3. An optional demultiplexing phase for barcoded samples
  4. A final conversion phase to generate FASTQ files from BAM output

Pipeline Outputs

The workflow produces the following key outputs:

  1. raw/: Directory containing the final FASTQ files
  2. summary/: Directory containing basecalling summary statistics

Using the Workflow

Installation & Setup

  1. Install Nextflow (23.04.0+)
  2. Install Docker
  3. Set up AWS BATCH
  4. Clone this repository

Running the Pipeline

Basic usage:

Create a new directory, name it after the delivery, copy in basecall.config as nextflow.config, and set the parameters. Params:

  • duplex
    • Duplex basecalling or no? You can't combine duplex and demux
  • demux
    • Demultiplex basecalling output?
  • batch_size
    • Sizing for batching pod5 files for parallelized base-calling
  • nanopore_run N
    • Name of run/delivery
  • kit
    • Name of ONT kit, needed for demux'ing
  • pod_5_dir
    • path to dir containing pod5 files
  • base_dir
    • path to where output will be saved to

Once that is done, you can switch into the directory and run

nextflow run .. -resume

About

Nextflow workflow to run basecalling on Nanopore data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published