Cargando…

Benchmarking ortholog identification methods using functional genomics data

BACKGROUND: The transfer of functional annotations from model organism proteins to human proteins is one of the main applications of comparative genomics. Various methods are used to analyze cross-species orthologous relationships according to an operational definition of orthology. Often the defini...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hulsen, Tim, Huynen, Martijn A, de Vlieg, Jacob, Groenen, Peter MA
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2006
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557999/ https://www.ncbi.nlm.nih.gov/pubmed/16613613 http://dx.doi.org/10.1186/gb-2006-7-4-r31

_version_	1782129429798977536
author	Hulsen, Tim Huynen, Martijn A de Vlieg, Jacob Groenen, Peter MA
author_facet	Hulsen, Tim Huynen, Martijn A de Vlieg, Jacob Groenen, Peter MA
author_sort	Hulsen, Tim
collection	PubMed
description	BACKGROUND: The transfer of functional annotations from model organism proteins to human proteins is one of the main applications of comparative genomics. Various methods are used to analyze cross-species orthologous relationships according to an operational definition of orthology. Often the definition of orthology is incorrectly interpreted as a prediction of proteins that are functionally equivalent across species, while in fact it only defines the existence of a common ancestor for a gene in different species. However, it has been demonstrated that orthologs often reveal significant functional similarity. Therefore, the quality of the orthology prediction is an important factor in the transfer of functional annotations (and other related information). To identify protein pairs with the highest possible functional similarity, it is important to qualify ortholog identification methods. RESULTS: To measure the similarity in function of proteins from different species we used functional genomics data, such as expression data and protein interaction data. We tested several of the most popular ortholog identification methods. In general, we observed a sensitivity/selectivity trade-off: the functional similarity scores per orthologous pair of sequences become higher when the number of proteins included in the ortholog groups decreases. CONCLUSION: By combining the sensitivity and the selectivity into an overall score, we show that the InParanoid program is the best ortholog identification method in terms of identifying functionally equivalent proteins.
format	Text
id	pubmed-1557999
institution	National Center for Biotechnology Information
language	English
publishDate	2006
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-15579992006-09-02 Benchmarking ortholog identification methods using functional genomics data Hulsen, Tim Huynen, Martijn A de Vlieg, Jacob Groenen, Peter MA Genome Biol Research BACKGROUND: The transfer of functional annotations from model organism proteins to human proteins is one of the main applications of comparative genomics. Various methods are used to analyze cross-species orthologous relationships according to an operational definition of orthology. Often the definition of orthology is incorrectly interpreted as a prediction of proteins that are functionally equivalent across species, while in fact it only defines the existence of a common ancestor for a gene in different species. However, it has been demonstrated that orthologs often reveal significant functional similarity. Therefore, the quality of the orthology prediction is an important factor in the transfer of functional annotations (and other related information). To identify protein pairs with the highest possible functional similarity, it is important to qualify ortholog identification methods. RESULTS: To measure the similarity in function of proteins from different species we used functional genomics data, such as expression data and protein interaction data. We tested several of the most popular ortholog identification methods. In general, we observed a sensitivity/selectivity trade-off: the functional similarity scores per orthologous pair of sequences become higher when the number of proteins included in the ortholog groups decreases. CONCLUSION: By combining the sensitivity and the selectivity into an overall score, we show that the InParanoid program is the best ortholog identification method in terms of identifying functionally equivalent proteins. BioMed Central 2006 2006-04-13 /pmc/articles/PMC1557999/ /pubmed/16613613 http://dx.doi.org/10.1186/gb-2006-7-4-r31 Text en Copyright © 2006 Hulsen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Hulsen, Tim Huynen, Martijn A de Vlieg, Jacob Groenen, Peter MA Benchmarking ortholog identification methods using functional genomics data
title	Benchmarking ortholog identification methods using functional genomics data
title_full	Benchmarking ortholog identification methods using functional genomics data
title_fullStr	Benchmarking ortholog identification methods using functional genomics data
title_full_unstemmed	Benchmarking ortholog identification methods using functional genomics data
title_short	Benchmarking ortholog identification methods using functional genomics data
title_sort	benchmarking ortholog identification methods using functional genomics data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557999/ https://www.ncbi.nlm.nih.gov/pubmed/16613613 http://dx.doi.org/10.1186/gb-2006-7-4-r31
work_keys_str_mv	AT hulsentim benchmarkingorthologidentificationmethodsusingfunctionalgenomicsdata AT huynenmartijna benchmarkingorthologidentificationmethodsusingfunctionalgenomicsdata AT devliegjacob benchmarkingorthologidentificationmethodsusingfunctionalgenomicsdata AT groenenpeterma benchmarkingorthologidentificationmethodsusingfunctionalgenomicsdata

Benchmarking ortholog identification methods using functional genomics data

Ejemplares similares