Cargando…

Incremental data integration for tracking genotype-disease associations

Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence...

Descripción completa

Detalles Bibliográficos
Autores principales: Konopka, Tomasz, Smedley, Damian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7004389/
https://www.ncbi.nlm.nih.gov/pubmed/31986132
http://dx.doi.org/10.1371/journal.pcbi.1007586
_version_ 1783494713948504064
author Konopka, Tomasz
Smedley, Damian
author_facet Konopka, Tomasz
Smedley, Damian
author_sort Konopka, Tomasz
collection PubMed
description Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probability updates. We apply the framework to quantify similarities between gene annotations and disease profiles. Within this scope, we categorize human diseases according to how well they can be recapitulated by animal models and quantify similarities between human diseases and mouse models produced by the International Mouse Phenotyping Consortium. The flexibility of the approach allows us to incorporate negative phenotypic data to better prioritize candidate genes, and to stratify disease mapping using sex-dependent phenotypes. All our association scores can be updated and we exploit this feature to showcase integration with curated annotations from high-precision assays. Incremental integration is thus a suitable framework for tracking functional annotations and linking to complex human pathology.
format Online
Article
Text
id pubmed-7004389
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70043892020-02-19 Incremental data integration for tracking genotype-disease associations Konopka, Tomasz Smedley, Damian PLoS Comput Biol Research Article Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probability updates. We apply the framework to quantify similarities between gene annotations and disease profiles. Within this scope, we categorize human diseases according to how well they can be recapitulated by animal models and quantify similarities between human diseases and mouse models produced by the International Mouse Phenotyping Consortium. The flexibility of the approach allows us to incorporate negative phenotypic data to better prioritize candidate genes, and to stratify disease mapping using sex-dependent phenotypes. All our association scores can be updated and we exploit this feature to showcase integration with curated annotations from high-precision assays. Incremental integration is thus a suitable framework for tracking functional annotations and linking to complex human pathology. Public Library of Science 2020-01-27 /pmc/articles/PMC7004389/ /pubmed/31986132 http://dx.doi.org/10.1371/journal.pcbi.1007586 Text en © 2020 Konopka, Smedley http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Konopka, Tomasz
Smedley, Damian
Incremental data integration for tracking genotype-disease associations
title Incremental data integration for tracking genotype-disease associations
title_full Incremental data integration for tracking genotype-disease associations
title_fullStr Incremental data integration for tracking genotype-disease associations
title_full_unstemmed Incremental data integration for tracking genotype-disease associations
title_short Incremental data integration for tracking genotype-disease associations
title_sort incremental data integration for tracking genotype-disease associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7004389/
https://www.ncbi.nlm.nih.gov/pubmed/31986132
http://dx.doi.org/10.1371/journal.pcbi.1007586
work_keys_str_mv AT konopkatomasz incrementaldataintegrationfortrackinggenotypediseaseassociations
AT smedleydamian incrementaldataintegrationfortrackinggenotypediseaseassociations