Cargando…

Detecting miRNA Mentions and Relations in Biomedical Literature

Introduction: MicroRNAs (miRNAs) have demonstrated their potential as post-transcriptional gene expression regulators, participating in a wide spectrum of regulatory events such as apoptosis, differentiation, and stress response. Apart from the role of miRNAs in normal physiology, their dysregulatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Bagewadi, Shweta, Bobić, Tamara, Hofmann-Apitius, Martin, Fluck, Juliane, Klinger, Roman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602280/
https://www.ncbi.nlm.nih.gov/pubmed/26535109
http://dx.doi.org/10.12688/f1000research.4591.3
_version_ 1782394689597472768
author Bagewadi, Shweta
Bobić, Tamara
Hofmann-Apitius, Martin
Fluck, Juliane
Klinger, Roman
author_facet Bagewadi, Shweta
Bobić, Tamara
Hofmann-Apitius, Martin
Fluck, Juliane
Klinger, Roman
author_sort Bagewadi, Shweta
collection PubMed
description Introduction: MicroRNAs (miRNAs) have demonstrated their potential as post-transcriptional gene expression regulators, participating in a wide spectrum of regulatory events such as apoptosis, differentiation, and stress response. Apart from the role of miRNAs in normal physiology, their dysregulation is implicated in a vast array of diseases. Dissection of miRNA-related associations are valuable for contemplating their mechanism in diseases, leading to the discovery of novel miRNAs for disease prognosis, diagnosis, and therapy. Motivation: Apart from databases and prediction tools, miRNA-related information is largely available as unstructured text. Manual retrieval of these associations can be labor-intensive due to steadily growing number of publications. Additionally, most of the published miRNA entity recognition methods are keyword based, further subjected to manual inspection for retrieval of relations. Despite the fact that several databases host miRNA-associations derived from text, lower sensitivity and lack of published details for miRNA entity recognition and associated relations identification has motivated the need for developing comprehensive methods that are freely available for the scientific community. Additionally, the lack of a standard corpus for miRNA-relations has caused difficulty in evaluating the available systems. We propose methods to automatically extract mentions of miRNAs, species, genes/proteins, disease, and relations from scientific literature. Our generated corpora, along with dictionaries, and miRNA regular expression are freely available for academic purposes. To our knowledge, these resources are the most comprehensive developed so far. Results: The identification of specific miRNA mentions reaches a recall of 0.94 and precision of 0.93.  Extraction of miRNA-disease and miRNA-gene relations lead to an F (1) score of up to 0.76. A comparison of the information extracted by our approach to the databases miR2Disease and miRSel for the extraction of Alzheimer's disease related relations shows the capability of our proposed methods in identifying correct relations with improved sensitivity. The published resources and described methods can help the researchers for maximal retrieval of miRNA-relations and generation of miRNA-regulatory networks. Availability: The training and test corpora, annotation guidelines, developed dictionaries, and supplementary files are available at http://www.scai.fraunhofer.de/mirna-corpora.html
format Online
Article
Text
id pubmed-4602280
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-46022802015-11-10 Detecting miRNA Mentions and Relations in Biomedical Literature Bagewadi, Shweta Bobić, Tamara Hofmann-Apitius, Martin Fluck, Juliane Klinger, Roman F1000Res Method Article Introduction: MicroRNAs (miRNAs) have demonstrated their potential as post-transcriptional gene expression regulators, participating in a wide spectrum of regulatory events such as apoptosis, differentiation, and stress response. Apart from the role of miRNAs in normal physiology, their dysregulation is implicated in a vast array of diseases. Dissection of miRNA-related associations are valuable for contemplating their mechanism in diseases, leading to the discovery of novel miRNAs for disease prognosis, diagnosis, and therapy. Motivation: Apart from databases and prediction tools, miRNA-related information is largely available as unstructured text. Manual retrieval of these associations can be labor-intensive due to steadily growing number of publications. Additionally, most of the published miRNA entity recognition methods are keyword based, further subjected to manual inspection for retrieval of relations. Despite the fact that several databases host miRNA-associations derived from text, lower sensitivity and lack of published details for miRNA entity recognition and associated relations identification has motivated the need for developing comprehensive methods that are freely available for the scientific community. Additionally, the lack of a standard corpus for miRNA-relations has caused difficulty in evaluating the available systems. We propose methods to automatically extract mentions of miRNAs, species, genes/proteins, disease, and relations from scientific literature. Our generated corpora, along with dictionaries, and miRNA regular expression are freely available for academic purposes. To our knowledge, these resources are the most comprehensive developed so far. Results: The identification of specific miRNA mentions reaches a recall of 0.94 and precision of 0.93.  Extraction of miRNA-disease and miRNA-gene relations lead to an F (1) score of up to 0.76. A comparison of the information extracted by our approach to the databases miR2Disease and miRSel for the extraction of Alzheimer's disease related relations shows the capability of our proposed methods in identifying correct relations with improved sensitivity. The published resources and described methods can help the researchers for maximal retrieval of miRNA-relations and generation of miRNA-regulatory networks. Availability: The training and test corpora, annotation guidelines, developed dictionaries, and supplementary files are available at http://www.scai.fraunhofer.de/mirna-corpora.html F1000Research 2015-10-01 /pmc/articles/PMC4602280/ /pubmed/26535109 http://dx.doi.org/10.12688/f1000research.4591.3 Text en Copyright: © 2015 Bagewadi S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Bagewadi, Shweta
Bobić, Tamara
Hofmann-Apitius, Martin
Fluck, Juliane
Klinger, Roman
Detecting miRNA Mentions and Relations in Biomedical Literature
title Detecting miRNA Mentions and Relations in Biomedical Literature
title_full Detecting miRNA Mentions and Relations in Biomedical Literature
title_fullStr Detecting miRNA Mentions and Relations in Biomedical Literature
title_full_unstemmed Detecting miRNA Mentions and Relations in Biomedical Literature
title_short Detecting miRNA Mentions and Relations in Biomedical Literature
title_sort detecting mirna mentions and relations in biomedical literature
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602280/
https://www.ncbi.nlm.nih.gov/pubmed/26535109
http://dx.doi.org/10.12688/f1000research.4591.3
work_keys_str_mv AT bagewadishweta detectingmirnamentionsandrelationsinbiomedicalliterature
AT bobictamara detectingmirnamentionsandrelationsinbiomedicalliterature
AT hofmannapitiusmartin detectingmirnamentionsandrelationsinbiomedicalliterature
AT fluckjuliane detectingmirnamentionsandrelationsinbiomedicalliterature
AT klingerroman detectingmirnamentionsandrelationsinbiomedicalliterature