-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AlignmentTools.jar pairwise-knn output #1
Comments
Hi, sheikki, The columns definition is in the header of the output file: #seqname k orientation score ident query_start query_end query_length Is "@650A9:00200:00424" a sequence of length 34? If so, this assignment Benli Chai RDP Staff On Wed, May 11, 2016 at 5:48 AM, sheikki [email protected] wrote:
RDP Staff |
Thank you for the reply. Oddly, in my alignment file, ref_start value is always zero. A few examples:
|
I'm classifying representative sequences of quality controlled and clustered 16S reads with command:
java -jar AlignmentTools.jar pairwise-knn query.fq db.fa
The db file is unaligned prokaryotic subset of RDP 11.4 clustered at 99% (with some sequence length thresholds).
Is this a sensible way to assign taxonomy to my representative sequences?
In output, I see lines like:
@650A9:00200:00424 1 + 155 1.000 0 34 34 0 83 S004055894 Listeria monocytogenes; CA5 Lineage=Root;rootrank;Bacteria;domain;Firmicutes;phylum;Bacilli;class;Bacillales;order;Listeriaceae;family;Listeria;genus
As far as I can tell it's QID KNEIGHBOURS STRAND SCORE %ID QSTART QEND QEND QSTART SSTART SID. Is this the correct interpretation? Why is it that the QSTART and QEND values are displayed twice?
The text was updated successfully, but these errors were encountered: