Cargando…

Ontology based mining of pathogen–disease associations from literature

BACKGROUND: Infectious diseases claim millions of lives especially in the developing countries each year. Identification of causative pathogens accurately and rapidly plays a key role in the success of treatment. To support infectious disease research and mechanisms of infection, there is a need for...

Descripción completa

Detalles Bibliográficos
Autores principales: Kafkas, Şenay, Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6751637/
https://www.ncbi.nlm.nih.gov/pubmed/31533864
http://dx.doi.org/10.1186/s13326-019-0208-2
_version_ 1783452649137373184
author Kafkas, Şenay
Hoehndorf, Robert
author_facet Kafkas, Şenay
Hoehndorf, Robert
author_sort Kafkas, Şenay
collection PubMed
description BACKGROUND: Infectious diseases claim millions of lives especially in the developing countries each year. Identification of causative pathogens accurately and rapidly plays a key role in the success of treatment. To support infectious disease research and mechanisms of infection, there is a need for an open resource on pathogen–disease associations that can be utilized in computational studies. A large number of pathogen–disease associations is available from the literature in unstructured form and we need automated methods to extract the data. RESULTS: We developed a text mining system designed for extracting pathogen–disease relations from literature. Our approach utilizes background knowledge from an ontology and statistical methods for extracting associations between pathogens and diseases. In total, we extracted a total of 3420 pathogen–disease associations from literature. We integrated our literature-derived associations into a database which links pathogens to their phenotypes for supporting infectious disease research. CONCLUSIONS: To the best of our knowledge, we present the first study focusing on extracting pathogen–disease associations from publications. We believe the text mined data can be utilized as a valuable resource for infectious disease research. All the data is publicly available from https://github.com/bio-ontology-research-group/padimi and through a public SPARQL endpoint from http://patho.phenomebrowser.net/.
format Online
Article
Text
id pubmed-6751637
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67516372019-09-23 Ontology based mining of pathogen–disease associations from literature Kafkas, Şenay Hoehndorf, Robert J Biomed Semantics Research BACKGROUND: Infectious diseases claim millions of lives especially in the developing countries each year. Identification of causative pathogens accurately and rapidly plays a key role in the success of treatment. To support infectious disease research and mechanisms of infection, there is a need for an open resource on pathogen–disease associations that can be utilized in computational studies. A large number of pathogen–disease associations is available from the literature in unstructured form and we need automated methods to extract the data. RESULTS: We developed a text mining system designed for extracting pathogen–disease relations from literature. Our approach utilizes background knowledge from an ontology and statistical methods for extracting associations between pathogens and diseases. In total, we extracted a total of 3420 pathogen–disease associations from literature. We integrated our literature-derived associations into a database which links pathogens to their phenotypes for supporting infectious disease research. CONCLUSIONS: To the best of our knowledge, we present the first study focusing on extracting pathogen–disease associations from publications. We believe the text mined data can be utilized as a valuable resource for infectious disease research. All the data is publicly available from https://github.com/bio-ontology-research-group/padimi and through a public SPARQL endpoint from http://patho.phenomebrowser.net/. BioMed Central 2019-09-18 /pmc/articles/PMC6751637/ /pubmed/31533864 http://dx.doi.org/10.1186/s13326-019-0208-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kafkas, Şenay
Hoehndorf, Robert
Ontology based mining of pathogen–disease associations from literature
title Ontology based mining of pathogen–disease associations from literature
title_full Ontology based mining of pathogen–disease associations from literature
title_fullStr Ontology based mining of pathogen–disease associations from literature
title_full_unstemmed Ontology based mining of pathogen–disease associations from literature
title_short Ontology based mining of pathogen–disease associations from literature
title_sort ontology based mining of pathogen–disease associations from literature
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6751637/
https://www.ncbi.nlm.nih.gov/pubmed/31533864
http://dx.doi.org/10.1186/s13326-019-0208-2
work_keys_str_mv AT kafkassenay ontologybasedminingofpathogendiseaseassociationsfromliterature
AT hoehndorfrobert ontologybasedminingofpathogendiseaseassociationsfromliterature