Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems with importing multiple files #98

Open
NicolasLouw opened this issue Apr 19, 2022 · 0 comments
Open

Problems with importing multiple files #98

NicolasLouw opened this issue Apr 19, 2022 · 0 comments

Comments

@NicolasLouw
Copy link

Good day,

Sorry for already posting another issue, but I went through all the issues and I could not find another posted issue that is similar enough to mine.
Right now, I am trying to combine multiple separate multiple sequence alignment files and organise it according to my one large combined multiple sequence alignment file to create a character set that I want to use an input for a maximum likelihood based tree in IQ-tree. I used MAFFT as an extension in Orthofinder to obtain my multiple sequence alignments between 8 different species. As a result, I have 10210 separate multiple sequence alignment files. Within each of those files, the headers in my fasta files between the species were problematic, because it had unique headers for the different protein names. I standardised the names of the headers in all the fasta files using sed. So now I only have 8 unique headers, representing my 8 different species in the multiple sequence alignment fasta files. Using those files, I am able to upload them into SequenceMatrix, but I do have one issue, when I drag and drop all the files, I get a warning message: "Some sequences in the taxonset OG0000003 weren't added. These are: Penicillium-brevicompactum: Multiple sequences with the same name found, only the largest one is being used"

If I click okay it successfully uploads some of the sequences. However, this warning message comes up for all of my separate files and there are over ten thousand. Is there a way that I can import my files by bypassing this warning message?

Thank you so much in advance!

Best,
Nicolas

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant