Skip to content

lsayaved/Metagenome-assembly-and-binning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Metagenome-assembly-and-binning

Current workflows in the symbiosis department

Step 1 -- I have a bunch of reads, now what?

Metagenome sequencing reads

  • Check your reads with FastQC
  • Check what organisms are there: PhyloFlash for 16S rRNA and COI
  • Quality trim your reads. We often use a quality of 2 since the reads get a higher quality after error correction (Bbduk). Remove PhiX, TruSeq adaptors.
  • If your paired-end reads overlap, try merging the reads (Bbmerge).
  • HGV tip: Check the K-mer spectrum.
  • Assemble your reads (e.g. Megahit, MetaSPades, IDBA_UD)
    • Megahit: Fast, but produces highly fragmented assemblies
    • MetaSpades: Crashes often with more than one library. Slow
    • IDBA_UD: Produces good N50, and has a good running time. The output does not include an assembly graph.

Step 2 -- Assembly

Before proceeding think about the following:

  • Does it make sense to pool the reads of different libraries together?

If yes, do you have access to a computer that can deal with all the data?

  • If you only have the sequencing of one sample, then assemble the one library. If your assembly is completely crappy, consider sequencing more samples.
  • Kmer 127: for reads >150 bp and coverage >5X
  • Brandon's script combines scaffold with graph

Step 3 -- Binning

Check quality control of the assembled genomes (See step 5)

Step 4 -- Reassembly

  • Often improves assembly metrics, but you should try it yourself.
  • More work and not scalable to many data sets.

Step 5 -- Quality control of assembly

Optional:

Other resources

About this repository

This repository contains results from metagenome binning roundtable discussion moderated by Liz S on 26 Sep 2016. Sharing it as a Git repository has three aims - (i) make it possible to edit it collaboratively, (ii) for newbies, have some practice using Git and Github for a simple project, and (iii) take advantage of the Markdown formatting to start building a simple webpage that can be used as a reference on metagenome binning.

Formatted in Markdown - learn more about the syntax here.

Learn more about Git and Github from the Github help pages, and Symbiosis wiki (internal link)

Suggested workflow:

  • Get a Github account if you don't have one already.
  • Fork this repository and clone it to your local machine.
  • Edit the text, create new pages, insert links, images, etc.
  • Commit those changes. (Tip: Commit small "chunks" of changes at a time, so that they can be easily undone if necessary, and document your commits with short but descriptive commit messages.)
  • Push changes back to Github.
  • Submit a pull request for Liz to merge your changes with the master version of the repository.

About

Current workflows in the symbiosis department

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages