Cargando…
causalizeR: a text mining algorithm to identify causal relationships in scientific literature
Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using dat...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/ https://www.ncbi.nlm.nih.gov/pubmed/34322328 http://dx.doi.org/10.7717/peerj.11850 |
_version_ | 1783726474487922688 |
---|---|
author | Ancin-Murguzur, Francisco J. Hausner, Vera H. |
author_facet | Ancin-Murguzur, Francisco J. Hausner, Vera H. |
author_sort | Ancin-Murguzur, Francisco J. |
collection | PubMed |
description | Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. |
format | Online Article Text |
id | pubmed-8300496 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83004962021-07-27 causalizeR: a text mining algorithm to identify causal relationships in scientific literature Ancin-Murguzur, Francisco J. Hausner, Vera H. PeerJ Bioinformatics Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. PeerJ Inc. 2021-07-20 /pmc/articles/PMC8300496/ /pubmed/34322328 http://dx.doi.org/10.7717/peerj.11850 Text en ©2021 Ancin-Murguzur and Hausner https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Ancin-Murguzur, Francisco J. Hausner, Vera H. causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title | causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title_full | causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title_fullStr | causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title_full_unstemmed | causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title_short | causalizeR: a text mining algorithm to identify causal relationships in scientific literature |
title_sort | causalizer: a text mining algorithm to identify causal relationships in scientific literature |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/ https://www.ncbi.nlm.nih.gov/pubmed/34322328 http://dx.doi.org/10.7717/peerj.11850 |
work_keys_str_mv | AT ancinmurguzurfranciscoj causalizeratextminingalgorithmtoidentifycausalrelationshipsinscientificliterature AT hausnerverah causalizeratextminingalgorithmtoidentifycausalrelationshipsinscientificliterature |