Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Genome -> Reads #2

Closed
7 tasks done
HadrienG opened this issue Nov 17, 2016 · 1 comment
Closed
7 tasks done

Genome -> Reads #2

HadrienG opened this issue Nov 17, 2016 · 1 comment
Assignees

Comments

@HadrienG
Copy link
Owner

HadrienG commented Nov 17, 2016

Issue to track the progress on the Roadmap item "From a genome, output perfect reads in fastq format"

Test file: genome.fasta
Pattern: ATCG
Length: 2000bp

  • add Biopython to requirements.txt (aka let's not re-write a fasta parser)
  • add Numpy to requirements.txt (required by Biopython)
  • generate 100 (pseudo)random subsequences in fasta format
  • generate x subsequences covering the input genome 40x (or an arbitrary coverage level)
  • add quality information (fastq output)
  • get out of app.py, import module and write ArgumentParser
  • setup travis and write tests

Note: Biopython discards everything after a blank space (' ') in the sequence headers

@HadrienG HadrienG self-assigned this Nov 17, 2016
@HadrienG HadrienG mentioned this issue Nov 17, 2016
20 tasks
@HadrienG
Copy link
Owner Author

Closed with f064c98

HadrienG pushed a commit that referenced this issue Sep 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant