Cargando…

Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation

BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarntivijai, Sirarat, Vasant, Drashtti, Jupp, Simon, Saunders, Gary, Bento, A. Patrícia, Gonzalez, Daniel, Betts, Joanna, Hasan, Samiul, Koscielny, Gautier, Dunham, Ian, Parkinson, Helen, Malone, James
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804633/
https://www.ncbi.nlm.nih.gov/pubmed/27011785
http://dx.doi.org/10.1186/s13326-016-0051-7
_version_ 1782423058887213056
author Sarntivijai, Sirarat
Vasant, Drashtti
Jupp, Simon
Saunders, Gary
Bento, A. Patrícia
Gonzalez, Daniel
Betts, Joanna
Hasan, Samiul
Koscielny, Gautier
Dunham, Ian
Parkinson, Helen
Malone, James
author_facet Sarntivijai, Sirarat
Vasant, Drashtti
Jupp, Simon
Saunders, Gary
Bento, A. Patrícia
Gonzalez, Daniel
Betts, Joanna
Hasan, Samiul
Koscielny, Gautier
Dunham, Ian
Parkinson, Helen
Malone, James
author_sort Sarntivijai, Sirarat
collection PubMed
description BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated data. Data integration has been achieved in some resources by mapping metadata such as disease and phenotypes to the Experimental Factor Ontology (EFO). Additionally, the relationship between ontology descriptions of rare and common diseases and their phenotypes can offer insights into shared biological mechanisms and potential drug targets. Ontologies are not ideal for representing the sometimes associated type relationship required. This work addresses two challenges; annotation of diverse big data, and representation of complex, sometimes associated relationships between concepts. METHODS: Semantic mapping uses a combination of custom scripting, our annotation tool ‘Zooma’, and expert curation. Disease-phenotype associations were generated using literature mining on Europe PubMed Central abstracts, which were manually verified by experts for validity. Representation of the disease-phenotype association was achieved by the Ontology of Biomedical AssociatioN (OBAN), a generic association representation model. OBAN represents associations between a subject and object i.e., disease and its associated phenotypes and the source of evidence for that association. The indirect disease-to-disease associations are exposed through shared phenotypes. This was applied to the use case of linking rare to common diseases at the CTTV. RESULTS: EFO yields an average of over 80 % of mapping coverage in all data sources. A 42 % precision is obtained from the manual verification of the text-mined disease-phenotype associations. This results in 1452 and 2810 disease-phenotype pairs for IBD and autoimmune disease and contributes towards 11,338 rare diseases associations (merged with existing published work [Am J Hum Genet 97:111-24, 2015]). An OBAN result file is downloadable at http://sourceforge.net/p/efo/code/HEAD/tree/trunk/src/efoassociations/. Twenty common diseases are linked to 85 rare diseases by shared phenotypes. A generalizable OBAN model for association representation is presented in this study. CONCLUSIONS: Here we present solutions to large-scale annotation-ontology mapping in the CTTV knowledge base, a process for disease-phenotype mining, and propose a generic association model, ‘OBAN’, as a means to integrate disease using shared phenotypes. AVAILABILITY: EFO is released monthly and available for download at http://www.ebi.ac.uk/efo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-016-0051-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4804633
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48046332016-03-24 Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation Sarntivijai, Sirarat Vasant, Drashtti Jupp, Simon Saunders, Gary Bento, A. Patrícia Gonzalez, Daniel Betts, Joanna Hasan, Samiul Koscielny, Gautier Dunham, Ian Parkinson, Helen Malone, James J Biomed Semantics Research BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated data. Data integration has been achieved in some resources by mapping metadata such as disease and phenotypes to the Experimental Factor Ontology (EFO). Additionally, the relationship between ontology descriptions of rare and common diseases and their phenotypes can offer insights into shared biological mechanisms and potential drug targets. Ontologies are not ideal for representing the sometimes associated type relationship required. This work addresses two challenges; annotation of diverse big data, and representation of complex, sometimes associated relationships between concepts. METHODS: Semantic mapping uses a combination of custom scripting, our annotation tool ‘Zooma’, and expert curation. Disease-phenotype associations were generated using literature mining on Europe PubMed Central abstracts, which were manually verified by experts for validity. Representation of the disease-phenotype association was achieved by the Ontology of Biomedical AssociatioN (OBAN), a generic association representation model. OBAN represents associations between a subject and object i.e., disease and its associated phenotypes and the source of evidence for that association. The indirect disease-to-disease associations are exposed through shared phenotypes. This was applied to the use case of linking rare to common diseases at the CTTV. RESULTS: EFO yields an average of over 80 % of mapping coverage in all data sources. A 42 % precision is obtained from the manual verification of the text-mined disease-phenotype associations. This results in 1452 and 2810 disease-phenotype pairs for IBD and autoimmune disease and contributes towards 11,338 rare diseases associations (merged with existing published work [Am J Hum Genet 97:111-24, 2015]). An OBAN result file is downloadable at http://sourceforge.net/p/efo/code/HEAD/tree/trunk/src/efoassociations/. Twenty common diseases are linked to 85 rare diseases by shared phenotypes. A generalizable OBAN model for association representation is presented in this study. CONCLUSIONS: Here we present solutions to large-scale annotation-ontology mapping in the CTTV knowledge base, a process for disease-phenotype mining, and propose a generic association model, ‘OBAN’, as a means to integrate disease using shared phenotypes. AVAILABILITY: EFO is released monthly and available for download at http://www.ebi.ac.uk/efo/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-016-0051-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-23 /pmc/articles/PMC4804633/ /pubmed/27011785 http://dx.doi.org/10.1186/s13326-016-0051-7 Text en © Sarntivijai et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Sarntivijai, Sirarat
Vasant, Drashtti
Jupp, Simon
Saunders, Gary
Bento, A. Patrícia
Gonzalez, Daniel
Betts, Joanna
Hasan, Samiul
Koscielny, Gautier
Dunham, Ian
Parkinson, Helen
Malone, James
Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title_full Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title_fullStr Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title_full_unstemmed Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title_short Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
title_sort linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804633/
https://www.ncbi.nlm.nih.gov/pubmed/27011785
http://dx.doi.org/10.1186/s13326-016-0051-7
work_keys_str_mv AT sarntivijaisirarat linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT vasantdrashtti linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT juppsimon linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT saundersgary linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT bentoapatricia linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT gonzalezdaniel linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT bettsjoanna linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT hasansamiul linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT koscielnygautier linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT dunhamian linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT parkinsonhelen linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation
AT malonejames linkingrareandcommondiseasemappingclinicaldiseasephenotypestoontologiesintherapeutictargetvalidation