Cargando…
Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804633/ https://www.ncbi.nlm.nih.gov/pubmed/27011785 http://dx.doi.org/10.1186/s13326-016-0051-7 |
_version_ | 1782423058887213056 |
---|---|
author | Sarntivijai, Sirarat Vasant, Drashtti Jupp, Simon Saunders, Gary Bento, A. Patrícia Gonzalez, Daniel Betts, Joanna Hasan, Samiul Koscielny, Gautier Dunham, Ian Parkinson, Helen Malone, James |
author_facet | Sarntivijai, Sirarat Vasant, Drashtti Jupp, Simon Saunders, Gary Bento, A. Patrícia Gonzalez, Daniel Betts, Joanna Hasan, Samiul Koscielny, Gautier Dunham, Ian Parkinson, Helen Malone, James |
author_sort | Sarntivijai, Sirarat |
collection | PubMed |
description | BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated data. Data integration has been achieved in some resources by mapping metadata such as disease and phenotypes to the Experimental Factor Ontology (EFO). Additionally, the relationship between ontology descriptions of rare and common diseases and their phenotypes can offer insights into shared biological mechanisms and potential drug targets. Ontologies are not ideal for representing the sometimes associated type relationship required. This work addresses two challenges; annotation of diverse big data, and representation of complex, sometimes associated relationships between concepts. METHODS: Semantic mapping uses a combination of custom scripting, our annotation tool ‘Zooma’, and expert curation. Disease-phenotype associations were generated using literature mining on Europe PubMed Central abstracts, which were manually verified by experts for validity. Representation of the disease-phenotype association was achieved by the Ontology of Biomedical AssociatioN (OBAN), a generic association representation model. OBAN represents associations between a subject and object i.e., disease and its associated phenotypes and the source of evidence for that association. The indirect disease-to-disease associations are exposed through shared phenotypes. This was applied to the use case of linking rare to common diseases at the CTTV. RESULTS: EFO yields an average of over 80 % of mapping coverage in all data sources. A 42 % precision is obtained from the manual verification of the text-mined disease-phenotype associations. This results in 1452 and 2810 disease-phenotype pairs for IBD and autoimmune disease and contributes towards 11,338 rare diseases associations (merged with existing published work [Am J Hum Genet 97:111-24, 2015]). An OBAN result file is downloadable at http://sourceforge.net/p/efo/code/HEAD/tree/trunk/src/efoassociations/. Twenty common diseases are linked to 85 rare diseases by shared phenotypes. A generalizable OBAN model for association representation is presented in this study. CONCLUSIONS: Here we present solutions to large-scale annotation-ontology mapping in the CTTV knowledge base, a process for disease-phenotype mining, and propose a generic association model, ‘OBAN’, as a means to integrate disease using shared phenotypes. AVAILABILITY: EFO is released monthly and available for download at http://www.ebi.ac.uk/efo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-016-0051-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4804633 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48046332016-03-24 Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation Sarntivijai, Sirarat Vasant, Drashtti Jupp, Simon Saunders, Gary Bento, A. Patrícia Gonzalez, Daniel Betts, Joanna Hasan, Samiul Koscielny, Gautier Dunham, Ian Parkinson, Helen Malone, James J Biomed Semantics Research BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated data. Data integration has been achieved in some resources by mapping metadata such as disease and phenotypes to the Experimental Factor Ontology (EFO). Additionally, the relationship between ontology descriptions of rare and common diseases and their phenotypes can offer insights into shared biological mechanisms and potential drug targets. Ontologies are not ideal for representing the sometimes associated type relationship required. This work addresses two challenges; annotation of diverse big data, and representation of complex, sometimes associated relationships between concepts. METHODS: Semantic mapping uses a combination of custom scripting, our annotation tool ‘Zooma’, and expert curation. Disease-phenotype associations were generated using literature mining on Europe PubMed Central abstracts, which were manually verified by experts for validity. Representation of the disease-phenotype association was achieved by the Ontology of Biomedical AssociatioN (OBAN), a generic association representation model. OBAN represents associations between a subject and object i.e., disease and its associated phenotypes and the source of evidence for that association. The indirect disease-to-disease associations are exposed through shared phenotypes. This was applied to the use case of linking rare to common diseases at the CTTV. RESULTS: EFO yields an average of over 80 % of mapping coverage in all data sources. A 42 % precision is obtained from the manual verification of the text-mined disease-phenotype associations. This results in 1452 and 2810 disease-phenotype pairs for IBD and autoimmune disease and contributes towards 11,338 rare diseases associations (merged with existing published work [Am J Hum Genet 97:111-24, 2015]). An OBAN result file is downloadable at http://sourceforge.net/p/efo/code/HEAD/tree/trunk/src/efoassociations/. Twenty common diseases are linked to 85 rare diseases by shared phenotypes. A generalizable OBAN model for association representation is presented in this study. CONCLUSIONS: Here we present solutions to large-scale annotation-ontology mapping in the CTTV knowledge base, a process for disease-phenotype mining, and propose a generic association model, ‘OBAN’, as a means to integrate disease using shared phenotypes. AVAILABILITY: EFO is released monthly and available for download at http://www.ebi.ac.uk/efo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-016-0051-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-23 /pmc/articles/PMC4804633/ /pubmed/27011785 http://dx.doi.org/10.1186/s13326-016-0051-7 Text en © Sarntivijai et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Sarntivijai, Sirarat Vasant, Drashtti Jupp, Simon Saunders, Gary Bento, A. Patrícia Gonzalez, Daniel Betts, Joanna Hasan, Samiul Koscielny, Gautier Dunham, Ian Parkinson, Helen Malone, James Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title | Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title_full | Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title_fullStr | Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title_full_unstemmed | Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title_short | Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
title_sort | linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804633/ https://www.ncbi.nlm.nih.gov/pubmed/27011785 http://dx.doi.org/10.1186/s13326-016-0051-7 |
work_keys_str_mv | AT sarntivijaisirarat linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT vasantdrashtti linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT juppsimon linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT saundersgary linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT bentoapatricia linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT gonzalezdaniel linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT bettsjoanna linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT hasansamiul linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT koscielnygautier linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT dunhamian linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT parkinsonhelen linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation AT malonejames linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation |