-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Classifier and CPU consumption #9
Comments
The CPU usage is likely proportional to the number of terminal taxa in the training set. There are about 2000 genera (terminal taxa) in the default 16S taxonomy. You can request less than 1G memory. Do you know how many terminal taxa in your training set? |
My taxonomy contains 82561 terminal taxa. |
We haven't used Classifier on large number of terminal taxa. The largest on On Thu, Apr 9, 2015 at 3:23 AM, fescudie [email protected] wrote:
Qiong |
With 100GB I have the same problem. |
This is interesting. I am wondering if you would like to share your Qiong On Mon, Apr 13, 2015 at 6:59 AM, fescudie [email protected] wrote:
Qiong |
You can get the training files at this URL: http://genoweb.toulouse.inra.fr/~fescudie/.
Consumption:
|
Hi,
When I use RDP classifier with my own databank (a very large 16S databank) the CPU usage of RDP is unacceptable : up to 2360% (see below).
This phenomena doesn't appear with the default databank and is more reduced with the databank provided in example of RDP train classifier.
How can I reduce the CPU consumption/nb threads of RDP classifier ?
Command with my databank:
Consumption:
Consumption with threads:
Command with RDP default databank:
Consumption:
Command with 'Example command to train classifier':
Consumption:
Thanks in advance.
The text was updated successfully, but these errors were encountered: