DECIPHER - IDTAXA Classify Functions FAQ

IDTAXA Classify Functions - Frequently Asked Questions:

Where is IDTAXA described? We are presently working on publishing the IDTAXA algorithm for functional classification of amino acid sequences.
Can I use IDTAXA on my computer?
Yes, please install DECIPHER and then look at the code page.
Where do training sets come from?
Training sets for functional classification were derived from the KEGG database. The complete KEGG training set is subsampled by lineage to have up to 100 represenatives per KEGG Orthology group, while the lineage-specific KEGG subsets include represenatives of all sequences clustered at ≥ 90% sequence identity. The lineage specific subsets will offer higher resolution classifications. The subsampled KEGG training set includes lineage information so it can be used to identify genome contamination (e.g., prokaryotic genes in a eukaryotic genome).