Skip to content

SumeetTiwari07/gmh-metafun

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

58 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gmh-metafun (beta)

Nextflow pipeline to perform functional profiling (genefamilies and pathways) of metagenomic short reads.

  1. Read cleaning (fastp)
  2. Summary of Species relative abundance (metaphlan3)
  3. Summary of Genefamilies and pathways abundance (humann3)
  4. Visulaziation of top N species, Genefamiles and pathways.

Rationale

This is a pipeline for functional as well species abundance profiling of Shotgun Metagenomic reads. This facilitates the estimation of Genefamiles and Pathways' abundance within the samples.

Dependencies

The pipeline is written in Nextflow and its dependencies are available in singularity container.

The YAML file with conda enviromment with required tools is provided in envs/env.yaml.

Databases

Usage

nextflow run humann.nf --reads 'data/*_R{1,2}.fastq.gz' \
    --outdir "nf-humann" -profile nbi

Notable options:

  • --uniref [path]: Path to the uniref database
  • --chocophlan [path] : Path to the chocophlan database
  • --metaphlandb [path] : Path to metaphlandb database

Profiles:

  • -profile nbi: use the default location in the NBI cluster and SLURM scheduler.
  • -profile vmqib: use the default location of the database in the NBI cluster and local scheduler.

Note: The default location of these database on NBI cluster can be found in the nextflow.config

About

Functional analysis of metagenome datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 70.1%
  • R 17.9%
  • Nextflow 12.0%