Cargando…

Contribution of model organism phenotypes to the computational identification of human disease genes

Computing phenotypic similarity helps identify new disease genes and diagnose rare diseases. Genotype–phenotype data from orthologous genes in model organisms can compensate for lack of human data and increase genome coverage. In the past decade, cross-species phenotype comparisons have proven valub...

Descripción completa

Detalles Bibliográficos
Autores principales: Alghamdi, Sarah M., Schofield, Paul N., Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Company of Biologists Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9366895/
https://www.ncbi.nlm.nih.gov/pubmed/35758016
http://dx.doi.org/10.1242/dmm.049441
_version_ 1784765670903250944
author Alghamdi, Sarah M.
Schofield, Paul N.
Hoehndorf, Robert
author_facet Alghamdi, Sarah M.
Schofield, Paul N.
Hoehndorf, Robert
author_sort Alghamdi, Sarah M.
collection PubMed
description Computing phenotypic similarity helps identify new disease genes and diagnose rare diseases. Genotype–phenotype data from orthologous genes in model organisms can compensate for lack of human data and increase genome coverage. In the past decade, cross-species phenotype comparisons have proven valuble, and several ontologies have been developed for this purpose. The relative contribution of different model organisms to computational identification of disease-associated genes is not fully explored. We used phenotype ontologies to semantically relate phenotypes resulting from loss-of-function mutations in model organisms to disease-associated phenotypes in humans. Semantic machine learning methods were used to measure the contribution of different model organisms to the identification of known human gene–disease associations. We found that mouse genotype–phenotype data provided the most important dataset in the identification of human disease genes by semantic similarity and machine learning over phenotype ontologies. Other model organisms' data did not improve identification over that obtained using the mouse alone, and therefore did not contribute significantly to this task. Our work impacts on the development of integrated phenotype ontologies, as well as for the use of model organism phenotypes in human genetic variant interpretation. This article has an associated First Person interview with the first author of the paper.
format Online
Article
Text
id pubmed-9366895
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Company of Biologists Ltd
record_format MEDLINE/PubMed
spelling pubmed-93668952022-08-11 Contribution of model organism phenotypes to the computational identification of human disease genes Alghamdi, Sarah M. Schofield, Paul N. Hoehndorf, Robert Dis Model Mech Research Article Computing phenotypic similarity helps identify new disease genes and diagnose rare diseases. Genotype–phenotype data from orthologous genes in model organisms can compensate for lack of human data and increase genome coverage. In the past decade, cross-species phenotype comparisons have proven valuble, and several ontologies have been developed for this purpose. The relative contribution of different model organisms to computational identification of disease-associated genes is not fully explored. We used phenotype ontologies to semantically relate phenotypes resulting from loss-of-function mutations in model organisms to disease-associated phenotypes in humans. Semantic machine learning methods were used to measure the contribution of different model organisms to the identification of known human gene–disease associations. We found that mouse genotype–phenotype data provided the most important dataset in the identification of human disease genes by semantic similarity and machine learning over phenotype ontologies. Other model organisms' data did not improve identification over that obtained using the mouse alone, and therefore did not contribute significantly to this task. Our work impacts on the development of integrated phenotype ontologies, as well as for the use of model organism phenotypes in human genetic variant interpretation. This article has an associated First Person interview with the first author of the paper. The Company of Biologists Ltd 2022-08-03 /pmc/articles/PMC9366895/ /pubmed/35758016 http://dx.doi.org/10.1242/dmm.049441 Text en © 2022. Published by The Company of Biologists Ltd https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.
spellingShingle Research Article
Alghamdi, Sarah M.
Schofield, Paul N.
Hoehndorf, Robert
Contribution of model organism phenotypes to the computational identification of human disease genes
title Contribution of model organism phenotypes to the computational identification of human disease genes
title_full Contribution of model organism phenotypes to the computational identification of human disease genes
title_fullStr Contribution of model organism phenotypes to the computational identification of human disease genes
title_full_unstemmed Contribution of model organism phenotypes to the computational identification of human disease genes
title_short Contribution of model organism phenotypes to the computational identification of human disease genes
title_sort contribution of model organism phenotypes to the computational identification of human disease genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9366895/
https://www.ncbi.nlm.nih.gov/pubmed/35758016
http://dx.doi.org/10.1242/dmm.049441
work_keys_str_mv AT alghamdisarahm contributionofmodelorganismphenotypestothecomputationalidentificationofhumandiseasegenes
AT schofieldpauln contributionofmodelorganismphenotypestothecomputationalidentificationofhumandiseasegenes
AT hoehndorfrobert contributionofmodelorganismphenotypestothecomputationalidentificationofhumandiseasegenes