Cargando…
Incremental data integration for tracking genotype-disease associations
Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7004389/ https://www.ncbi.nlm.nih.gov/pubmed/31986132 http://dx.doi.org/10.1371/journal.pcbi.1007586 |
_version_ | 1783494713948504064 |
---|---|
author | Konopka, Tomasz Smedley, Damian |
author_facet | Konopka, Tomasz Smedley, Damian |
author_sort | Konopka, Tomasz |
collection | PubMed |
description | Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probability updates. We apply the framework to quantify similarities between gene annotations and disease profiles. Within this scope, we categorize human diseases according to how well they can be recapitulated by animal models and quantify similarities between human diseases and mouse models produced by the International Mouse Phenotyping Consortium. The flexibility of the approach allows us to incorporate negative phenotypic data to better prioritize candidate genes, and to stratify disease mapping using sex-dependent phenotypes. All our association scores can be updated and we exploit this feature to showcase integration with curated annotations from high-precision assays. Incremental integration is thus a suitable framework for tracking functional annotations and linking to complex human pathology. |
format | Online Article Text |
id | pubmed-7004389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-70043892020-02-19 Incremental data integration for tracking genotype-disease associations Konopka, Tomasz Smedley, Damian PLoS Comput Biol Research Article Functional annotation of genes remains a challenge in fundamental biology and is a limiting factor for translational medicine. Computational approaches have been developed to process heterogeneous data into meaningful metrics, but often do not address how findings might be updated when new evidence comes to light. To address this challenge, we describe requirements for a framework for incremental data integration and propose an implementation based on phenotype ontologies and Bayesian probability updates. We apply the framework to quantify similarities between gene annotations and disease profiles. Within this scope, we categorize human diseases according to how well they can be recapitulated by animal models and quantify similarities between human diseases and mouse models produced by the International Mouse Phenotyping Consortium. The flexibility of the approach allows us to incorporate negative phenotypic data to better prioritize candidate genes, and to stratify disease mapping using sex-dependent phenotypes. All our association scores can be updated and we exploit this feature to showcase integration with curated annotations from high-precision assays. Incremental integration is thus a suitable framework for tracking functional annotations and linking to complex human pathology. Public Library of Science 2020-01-27 /pmc/articles/PMC7004389/ /pubmed/31986132 http://dx.doi.org/10.1371/journal.pcbi.1007586 Text en © 2020 Konopka, Smedley http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Konopka, Tomasz Smedley, Damian Incremental data integration for tracking genotype-disease associations |
title | Incremental data integration for tracking genotype-disease associations |
title_full | Incremental data integration for tracking genotype-disease associations |
title_fullStr | Incremental data integration for tracking genotype-disease associations |
title_full_unstemmed | Incremental data integration for tracking genotype-disease associations |
title_short | Incremental data integration for tracking genotype-disease associations |
title_sort | incremental data integration for tracking genotype-disease associations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7004389/ https://www.ncbi.nlm.nih.gov/pubmed/31986132 http://dx.doi.org/10.1371/journal.pcbi.1007586 |
work_keys_str_mv | AT konopkatomasz incrementaldataintegrationfortrackinggenotypediseaseassociations AT smedleydamian incrementaldataintegrationfortrackinggenotypediseaseassociations |