Cargando…

Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature

Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth...

Descripción completa

Detalles Bibliográficos
Autores principales: VanSchaik, Jack T., Jain, Palak, Rajapuri, Anushri, Cheriyan, Biju, Thyvalikakath, Thankam P., Chakraborty, Sunandan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558331/
https://www.ncbi.nlm.nih.gov/pubmed/37809371
http://dx.doi.org/10.1016/j.heliyon.2023.e19265
_version_ 1785117251547955200
author VanSchaik, Jack T.
Jain, Palak
Rajapuri, Anushri
Cheriyan, Biju
Thyvalikakath, Thankam P.
Chakraborty, Sunandan
author_facet VanSchaik, Jack T.
Jain, Palak
Rajapuri, Anushri
Cheriyan, Biju
Thyvalikakath, Thankam P.
Chakraborty, Sunandan
author_sort VanSchaik, Jack T.
collection PubMed
description Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren's syndrome from biomedical literature. Sjögren's syndrome is an autoimmune disease affecting up to 3.1 million Americans. Due to the uncommon nature of the illness, symptoms across different specialties coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to diagnose the disease timely. Due to the lack of a dedicated dataset for causal relationships built from biomedical literature, we propose a transfer learning-based approach, where the relationship extraction model is trained on a wide variety of datasets. We conduct an empirical analysis of numerous neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that an ELECTRA-based sentence-level relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. We use this empirical observation to create a pipeline for identifying causal sentences from literature text, extracting the causal relationships from causal sentences, and building a causal network consisting of latent factors related to Sjögren's syndrome. We show that our approach can retrieve such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN. We apply this model to a corpus of research articles related to Sjögren's syndrome collected from PubMed to create a causal network for Sjögren's syndrome. The proposed causal network for Sjögren's syndrome will potentially help clinicians with a holistic knowledge base for faster diagnosis.
format Online
Article
Text
id pubmed-10558331
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105583312023-10-08 Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature VanSchaik, Jack T. Jain, Palak Rajapuri, Anushri Cheriyan, Biju Thyvalikakath, Thankam P. Chakraborty, Sunandan Heliyon Research Article Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren's syndrome from biomedical literature. Sjögren's syndrome is an autoimmune disease affecting up to 3.1 million Americans. Due to the uncommon nature of the illness, symptoms across different specialties coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to diagnose the disease timely. Due to the lack of a dedicated dataset for causal relationships built from biomedical literature, we propose a transfer learning-based approach, where the relationship extraction model is trained on a wide variety of datasets. We conduct an empirical analysis of numerous neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that an ELECTRA-based sentence-level relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. We use this empirical observation to create a pipeline for identifying causal sentences from literature text, extracting the causal relationships from causal sentences, and building a causal network consisting of latent factors related to Sjögren's syndrome. We show that our approach can retrieve such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN. We apply this model to a corpus of research articles related to Sjögren's syndrome collected from PubMed to create a causal network for Sjögren's syndrome. The proposed causal network for Sjögren's syndrome will potentially help clinicians with a holistic knowledge base for faster diagnosis. Elsevier 2023-08-22 /pmc/articles/PMC10558331/ /pubmed/37809371 http://dx.doi.org/10.1016/j.heliyon.2023.e19265 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
VanSchaik, Jack T.
Jain, Palak
Rajapuri, Anushri
Cheriyan, Biju
Thyvalikakath, Thankam P.
Chakraborty, Sunandan
Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title_full Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title_fullStr Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title_full_unstemmed Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title_short Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature
title_sort using transfer learning-based causality extraction to mine latent factors for sjögren's syndrome from biomedical literature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558331/
https://www.ncbi.nlm.nih.gov/pubmed/37809371
http://dx.doi.org/10.1016/j.heliyon.2023.e19265
work_keys_str_mv AT vanschaikjackt usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature
AT jainpalak usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature
AT rajapurianushri usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature
AT cheriyanbiju usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature
AT thyvalikakaththankamp usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature
AT chakrabortysunandan usingtransferlearningbasedcausalityextractiontominelatentfactorsforsjogrenssyndromefrombiomedicalliterature