Cargando…

WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning

The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for...

Descripción completa

Detalles Bibliográficos
Autores principales: Sutphin, George L., Mahoney, J. Matthew, Sheppard, Keith, Walton, David O., Korstanje, Ron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5094675/
https://www.ncbi.nlm.nih.gov/pubmed/27812085
http://dx.doi.org/10.1371/journal.pcbi.1005182
_version_ 1782465149867655168
author Sutphin, George L.
Mahoney, J. Matthew
Sheppard, Keith
Walton, David O.
Korstanje, Ron
author_facet Sutphin, George L.
Mahoney, J. Matthew
Sheppard, Keith
Walton, David O.
Korstanje, Ron
author_sort Sutphin, George L.
collection PubMed
description The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for predicting orthologs have been developed using combinations of gene and protein sequence, phylogenetic history, and functional interaction with progressively increasing accuracy. A relatively new class of orthology prediction strategies combines aspects of multiple methods into meta-tools, resulting in improved prediction performance. Here we present WORMHOLE, a novel ortholog prediction meta-tool that applies machine learning to integrate 17 distinct ortholog prediction algorithms to identify novel least diverged orthologs (LDOs) between 6 eukaryotic species—humans, mice, zebrafish, fruit flies, nematodes, and budding yeast. Machine learning allows WORMHOLE to intelligently incorporate predictions from a wide-spectrum of strategies in order to form aggregate predictions of LDOs with high confidence. In this study we demonstrate the performance of WORMHOLE across each combination of query and target species. We show that WORMHOLE is particularly adept at improving LDO prediction performance between distantly related species, expanding the pool of LDOs while maintaining low evolutionary distance and a high level of functional relatedness between genes in LDO pairs. We present extensive validation, including cross-validated prediction of PANTHER LDOs and evaluation of evolutionary divergence and functional similarity, and discuss future applications of machine learning in ortholog prediction. A WORMHOLE web tool has been developed and is available at http://wormhole.jax.org/.
format Online
Article
Text
id pubmed-5094675
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-50946752016-11-18 WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning Sutphin, George L. Mahoney, J. Matthew Sheppard, Keith Walton, David O. Korstanje, Ron PLoS Comput Biol Research Article The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for predicting orthologs have been developed using combinations of gene and protein sequence, phylogenetic history, and functional interaction with progressively increasing accuracy. A relatively new class of orthology prediction strategies combines aspects of multiple methods into meta-tools, resulting in improved prediction performance. Here we present WORMHOLE, a novel ortholog prediction meta-tool that applies machine learning to integrate 17 distinct ortholog prediction algorithms to identify novel least diverged orthologs (LDOs) between 6 eukaryotic species—humans, mice, zebrafish, fruit flies, nematodes, and budding yeast. Machine learning allows WORMHOLE to intelligently incorporate predictions from a wide-spectrum of strategies in order to form aggregate predictions of LDOs with high confidence. In this study we demonstrate the performance of WORMHOLE across each combination of query and target species. We show that WORMHOLE is particularly adept at improving LDO prediction performance between distantly related species, expanding the pool of LDOs while maintaining low evolutionary distance and a high level of functional relatedness between genes in LDO pairs. We present extensive validation, including cross-validated prediction of PANTHER LDOs and evaluation of evolutionary divergence and functional similarity, and discuss future applications of machine learning in ortholog prediction. A WORMHOLE web tool has been developed and is available at http://wormhole.jax.org/. Public Library of Science 2016-11-03 /pmc/articles/PMC5094675/ /pubmed/27812085 http://dx.doi.org/10.1371/journal.pcbi.1005182 Text en © 2016 Sutphin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sutphin, George L.
Mahoney, J. Matthew
Sheppard, Keith
Walton, David O.
Korstanje, Ron
WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title_full WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title_fullStr WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title_full_unstemmed WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title_short WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning
title_sort wormhole: novel least diverged ortholog prediction through machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5094675/
https://www.ncbi.nlm.nih.gov/pubmed/27812085
http://dx.doi.org/10.1371/journal.pcbi.1005182
work_keys_str_mv AT sutphingeorgel wormholenovelleastdivergedorthologpredictionthroughmachinelearning
AT mahoneyjmatthew wormholenovelleastdivergedorthologpredictionthroughmachinelearning
AT sheppardkeith wormholenovelleastdivergedorthologpredictionthroughmachinelearning
AT waltondavido wormholenovelleastdivergedorthologpredictionthroughmachinelearning
AT korstanjeron wormholenovelleastdivergedorthologpredictionthroughmachinelearning