Cargando…

Prediction of gene–phenotype associations in humans, mice, and plants using phenologs

BACKGROUND: Phenotypes and diseases may be related to seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such “orthologous phenotypes,” or “phenologs,” are examples of deep homology, and may be used to predict additional candidate disease genes. RESULTS:...

Descripción completa

Detalles Bibliográficos
Autores principales: Woods, John O, Singh-Blom, Ulf Martin, Laurent, Jon M, McGary, Kriston L, Marcotte, Edward M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3704650/
https://www.ncbi.nlm.nih.gov/pubmed/23800157
http://dx.doi.org/10.1186/1471-2105-14-203
_version_ 1782276059265236992
author Woods, John O
Singh-Blom, Ulf Martin
Laurent, Jon M
McGary, Kriston L
Marcotte, Edward M
author_facet Woods, John O
Singh-Blom, Ulf Martin
Laurent, Jon M
McGary, Kriston L
Marcotte, Edward M
author_sort Woods, John O
collection PubMed
description BACKGROUND: Phenotypes and diseases may be related to seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such “orthologous phenotypes,” or “phenologs,” are examples of deep homology, and may be used to predict additional candidate disease genes. RESULTS: In this work, we develop an unsupervised algorithm for ranking phenolog-based candidate disease genes through the integration of predictions from the k nearest neighbor phenologs, comparing classifiers and weighting functions by cross-validation. We also improve upon the original method by extending the theory to paralogous phenotypes. Our algorithm makes use of additional phenotype data — from chicken, zebrafish, and E. coli, as well as new datasets for C. elegans — establishing that several types of annotations may be treated as phenotypes. We demonstrate the use of our algorithm to predict novel candidate genes for human atrial fibrillation (such as HRH2, ATP4A, ATP4B, and HOPX) and epilepsy (e.g., PAX6 and NKX2-1). We suggest gene candidates for pharmacologically-induced seizures in mouse, solely based on orthologous phenotypes from E. coli. We also explore the prediction of plant gene–phenotype associations, as for the Arabidopsis response to vernalization phenotype. CONCLUSIONS: We are able to rank gene predictions for a significant portion of the diseases in the Online Mendelian Inheritance in Man database. Additionally, our method suggests candidate genes for mammalian seizures based only on bacterial phenotypes and gene orthology. We demonstrate that phenotype information may come from diverse sources, including drug sensitivities, gene ontology biological processes, and in situ hybridization annotations. Finally, we offer testable candidates for a variety of human diseases, plant traits, and other classes of phenotypes across a wide array of species.
format Online
Article
Text
id pubmed-3704650
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37046502013-07-12 Prediction of gene–phenotype associations in humans, mice, and plants using phenologs Woods, John O Singh-Blom, Ulf Martin Laurent, Jon M McGary, Kriston L Marcotte, Edward M BMC Bioinformatics Research Article BACKGROUND: Phenotypes and diseases may be related to seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such “orthologous phenotypes,” or “phenologs,” are examples of deep homology, and may be used to predict additional candidate disease genes. RESULTS: In this work, we develop an unsupervised algorithm for ranking phenolog-based candidate disease genes through the integration of predictions from the k nearest neighbor phenologs, comparing classifiers and weighting functions by cross-validation. We also improve upon the original method by extending the theory to paralogous phenotypes. Our algorithm makes use of additional phenotype data — from chicken, zebrafish, and E. coli, as well as new datasets for C. elegans — establishing that several types of annotations may be treated as phenotypes. We demonstrate the use of our algorithm to predict novel candidate genes for human atrial fibrillation (such as HRH2, ATP4A, ATP4B, and HOPX) and epilepsy (e.g., PAX6 and NKX2-1). We suggest gene candidates for pharmacologically-induced seizures in mouse, solely based on orthologous phenotypes from E. coli. We also explore the prediction of plant gene–phenotype associations, as for the Arabidopsis response to vernalization phenotype. CONCLUSIONS: We are able to rank gene predictions for a significant portion of the diseases in the Online Mendelian Inheritance in Man database. Additionally, our method suggests candidate genes for mammalian seizures based only on bacterial phenotypes and gene orthology. We demonstrate that phenotype information may come from diverse sources, including drug sensitivities, gene ontology biological processes, and in situ hybridization annotations. Finally, we offer testable candidates for a variety of human diseases, plant traits, and other classes of phenotypes across a wide array of species. BioMed Central 2013-06-21 /pmc/articles/PMC3704650/ /pubmed/23800157 http://dx.doi.org/10.1186/1471-2105-14-203 Text en Copyright © 2013 Woods et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Woods, John O
Singh-Blom, Ulf Martin
Laurent, Jon M
McGary, Kriston L
Marcotte, Edward M
Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title_full Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title_fullStr Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title_full_unstemmed Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title_short Prediction of gene–phenotype associations in humans, mice, and plants using phenologs
title_sort prediction of gene–phenotype associations in humans, mice, and plants using phenologs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3704650/
https://www.ncbi.nlm.nih.gov/pubmed/23800157
http://dx.doi.org/10.1186/1471-2105-14-203
work_keys_str_mv AT woodsjohno predictionofgenephenotypeassociationsinhumansmiceandplantsusingphenologs
AT singhblomulfmartin predictionofgenephenotypeassociationsinhumansmiceandplantsusingphenologs
AT laurentjonm predictionofgenephenotypeassociationsinhumansmiceandplantsusingphenologs
AT mcgarykristonl predictionofgenephenotypeassociationsinhumansmiceandplantsusingphenologs
AT marcotteedwardm predictionofgenephenotypeassociationsinhumansmiceandplantsusingphenologs