Cargando…

The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification

The PhyloFacts ‘Fast Approximate Tree Classification’ (FAT-CAT) web server provides a novel approach to ortholog identification using subtree hidden Markov model-based placement of protein sequences to phylogenomic orthology groups in the PhyloFacts database. Results on a data set of microbial, plan...

Descripción completa

Detalles Bibliográficos
Autores principales: Afrasiabi, Cyrus, Samad, Bushra, Dineen, David, Meacham, Christopher, Sjölander, Kimmen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692063/
https://www.ncbi.nlm.nih.gov/pubmed/23685612
http://dx.doi.org/10.1093/nar/gkt399
_version_ 1782274560949747712
author Afrasiabi, Cyrus
Samad, Bushra
Dineen, David
Meacham, Christopher
Sjölander, Kimmen
author_facet Afrasiabi, Cyrus
Samad, Bushra
Dineen, David
Meacham, Christopher
Sjölander, Kimmen
author_sort Afrasiabi, Cyrus
collection PubMed
description The PhyloFacts ‘Fast Approximate Tree Classification’ (FAT-CAT) web server provides a novel approach to ortholog identification using subtree hidden Markov model-based placement of protein sequences to phylogenomic orthology groups in the PhyloFacts database. Results on a data set of microbial, plant and animal proteins demonstrate FAT-CAT’s high precision at separating orthologs and paralogs and robustness to promiscuous domains. We also present results documenting the precision of ortholog identification based on subtree hidden Markov model scoring. The FAT-CAT phylogenetic placement is used to derive a functional annotation for the query, including confidence scores and drill-down capabilities. PhyloFacts’ broad taxonomic and functional coverage, with >7.3 M proteins from across the Tree of Life, enables FAT-CAT to predict orthologs and assign function for most sequence inputs. Four pipeline parameter presets are provided to handle different sequence types, including partial sequences and proteins containing promiscuous domains; users can also modify individual parameters. PhyloFacts trees matching the query can be viewed interactively online using the PhyloScope Javascript tree viewer and are hyperlinked to various external databases. The FAT-CAT web server is available at http://phylogenomics.berkeley.edu/phylofacts/fatcat/.
format Online
Article
Text
id pubmed-3692063
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36920632013-06-25 The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification Afrasiabi, Cyrus Samad, Bushra Dineen, David Meacham, Christopher Sjölander, Kimmen Nucleic Acids Res Articles The PhyloFacts ‘Fast Approximate Tree Classification’ (FAT-CAT) web server provides a novel approach to ortholog identification using subtree hidden Markov model-based placement of protein sequences to phylogenomic orthology groups in the PhyloFacts database. Results on a data set of microbial, plant and animal proteins demonstrate FAT-CAT’s high precision at separating orthologs and paralogs and robustness to promiscuous domains. We also present results documenting the precision of ortholog identification based on subtree hidden Markov model scoring. The FAT-CAT phylogenetic placement is used to derive a functional annotation for the query, including confidence scores and drill-down capabilities. PhyloFacts’ broad taxonomic and functional coverage, with >7.3 M proteins from across the Tree of Life, enables FAT-CAT to predict orthologs and assign function for most sequence inputs. Four pipeline parameter presets are provided to handle different sequence types, including partial sequences and proteins containing promiscuous domains; users can also modify individual parameters. PhyloFacts trees matching the query can be viewed interactively online using the PhyloScope Javascript tree viewer and are hyperlinked to various external databases. The FAT-CAT web server is available at http://phylogenomics.berkeley.edu/phylofacts/fatcat/. Oxford University Press 2013-07 2013-05-18 /pmc/articles/PMC3692063/ /pubmed/23685612 http://dx.doi.org/10.1093/nar/gkt399 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Afrasiabi, Cyrus
Samad, Bushra
Dineen, David
Meacham, Christopher
Sjölander, Kimmen
The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title_full The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title_fullStr The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title_full_unstemmed The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title_short The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
title_sort phylofacts fat-cat web server: ortholog identification and function prediction using fast approximate tree classification
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692063/
https://www.ncbi.nlm.nih.gov/pubmed/23685612
http://dx.doi.org/10.1093/nar/gkt399
work_keys_str_mv AT afrasiabicyrus thephylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT samadbushra thephylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT dineendavid thephylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT meachamchristopher thephylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT sjolanderkimmen thephylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT afrasiabicyrus phylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT samadbushra phylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT dineendavid phylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT meachamchristopher phylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification
AT sjolanderkimmen phylofactsfatcatwebserverorthologidentificationandfunctionpredictionusingfastapproximatetreeclassification