Cargando…

SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations

BACKGROUND: Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered an...

Descripción completa

Detalles Bibliográficos
Autores principales: Rosário-Ferreira, Nícia, Guimarães, Victor, Costa, Vítor S., Moreira, Irina S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8491382/
https://www.ncbi.nlm.nih.gov/pubmed/34607568
http://dx.doi.org/10.1186/s12859-021-04397-w
Descripción
Sumario:BACKGROUND: Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison. RESULTS: We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline. CONCLUSIONS: SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04397-w.