Skip to content

zhaoxvwahaha/BacAssembly

Repository files navigation

BacAssembly

A hybrid assembly pipeline for bacteria

Installation

bacassembly works with Python 3.6+.

[WIP]
$ conda install -c bioconda bacassembly

Or via pip:

[WIP]
$ pip install bacassembly

Run

help

$ bacassembly --help

  ______             ___                         _     _
  | ___ \           / _ \                       | |   | |
  | |_/ / __ _  ___/ /_\ \___ ___  ___ _ __ ___ | |__ | |_   _
  | ___ \/ _` |/ __|  _  / __/ __|/ _ \ '_ ` _ \| '_ \| | | | |
  | |_/ / (_| | (__| | | \__ \__ \  __/ | | | | | |_) | | |_| |
  \____/ \__,_|\___\_| |_/___/___/\___|_| |_| |_|_.__/|_|\__, |
                                                        __/ |
                                                       |___/

             Omics for All, Open Source for All

     Hybrid assembly and polish using short and long reads

  optional arguments:
  -h, --help     show this help message and exit
  -v, --version  print software version and exit

  available subcommands:

  init         init project
  denovo_wf    denovo_wf pipeline

init

$ bacassembly init --help
  usage: bacassembly init [-h] [-d WORKDIR] [-s SAMPLES]
                          [-b {trimming,rmhost,assembly}]

  optional arguments:
  -h, --help            show this help message and exit
  -d WORKDIR, --workdir WORKDIR
                        project workdir, default: ./
  -s SAMPLES, --samples SAMPLES
                        desired input: samples list, tsv format required.
                        header: ["id", "long_reads", "short_reads_1", "short_reads_2"]
  -b {trimming,rmhost,assembly}, --begin {trimming,rmhost,assembly}
                        pipeline starting point

denovo_wf

$ bacassembly denovo_wf --help
  usage: bacassembly denovo_wf [-h] [-d WORKDIR] [--config CONFIG]
                               [--cores CORES] [--jobs JOBS] [--list] [--run]
                               [--debug] [--dry_run] [--qsub] [--wait WAIT]
                               [--snake [SNAKEMAKEARGS]]
                               [TASK]

  positional arguments:
  TASK                  pipeline end point. Allowed values are trimming_all,
                        all

  optional arguments:
  -h, --help            show this help message and exit
  -d WORKDIR, --workdir WORKDIR
                        project workdir, default: ./
  --config CONFIG       config.yaml, default: ./config.yaml
  --cores CORES         CPU cores, default: 8
  --jobs JOBS           qsub job numbers, default: 80
  --list                list pipeline rules
  --run                 run pipeline
  --debug               debug pipeline
  --dry_run             dry run pipeline
  --qsub                qsub pipeline
  --wait WAIT           wait given seconds, default: 60
  --snake [SNAKEMAKEARGS]
                        other snakemake command options(sankemake -h), if want
                        --touch, just --snake touch

Example

# init project
$ bacassembly init -d . -s samples.tsv -b trimming

# run trimming
$ metapi denovo_wf trimming_all --run

# run all
$ metapi denovo_wf --run

input requirements

The input samples file: samples.tsv format:

Note: If id col contain same id, then the reads of each sample will be merged. Note: The fastq need gzip compress.

id long_reads short_reads_1 short_reads_2
s1 aa.long.fq.gz aa.1.fq.gz aa.2.fq.gz
s2 bb.long.fq.gz bb.1.fq.gz bb.2.fq.gz
s2 cc.long.fq.gz cc.1.fq.gz cc.2.fq.gz
s3 dd.long.fq.gz dd.1.fq.gz dd.2.fq.gz

Getting help

If you want to report a bug or issue, or have problems with installing or running the software, please create a new issue.

License

This module is licensed under the terms of the GPLv3 license.

About

hybrid assembly and polish using short and long reads

Resources

Stars

Watchers

Forks

Packages

No packages published