Cargando…
Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more e...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Environmental Health Perspectives
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893280/ https://www.ncbi.nlm.nih.gov/pubmed/35238605 http://dx.doi.org/10.1289/EHP9112 |
_version_ | 1784662356706459648 |
---|---|
author | Scholten, Bernice Simón, Laura Guerrero Krishnan, Shaji Vermeulen, Roel Pronk, Anjoeka Gyori, Benjamin M. Bachman, John A. Vlaanderen, Jelle Stierum, Rob |
author_facet | Scholten, Bernice Simón, Laura Guerrero Krishnan, Shaji Vermeulen, Roel Pronk, Anjoeka Gyori, Benjamin M. Bachman, John A. Vlaanderen, Jelle Stierum, Rob |
author_sort | Scholten, Bernice |
collection | PubMed |
description | BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools. METHODS: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., “benzene activates DNA damage”). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach. RESULTS: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation. DISCUSSION: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112 |
format | Online Article Text |
id | pubmed-8893280 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Environmental Health Perspectives |
record_format | MEDLINE/PubMed |
spelling | pubmed-88932802022-03-07 Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment Scholten, Bernice Simón, Laura Guerrero Krishnan, Shaji Vermeulen, Roel Pronk, Anjoeka Gyori, Benjamin M. Bachman, John A. Vlaanderen, Jelle Stierum, Rob Environ Health Perspect Research BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools. METHODS: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., “benzene activates DNA damage”). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach. RESULTS: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation. DISCUSSION: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112 Environmental Health Perspectives 2022-03-03 /pmc/articles/PMC8893280/ /pubmed/35238605 http://dx.doi.org/10.1289/EHP9112 Text en https://ehp.niehs.nih.gov/about-ehp/licenseEHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted. |
spellingShingle | Research Scholten, Bernice Simón, Laura Guerrero Krishnan, Shaji Vermeulen, Roel Pronk, Anjoeka Gyori, Benjamin M. Bachman, John A. Vlaanderen, Jelle Stierum, Rob Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title | Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title_full | Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title_fullStr | Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title_full_unstemmed | Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title_short | Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment |
title_sort | automated network assembly of mechanistic literature for informed evidence identification to support cancer risk assessment |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893280/ https://www.ncbi.nlm.nih.gov/pubmed/35238605 http://dx.doi.org/10.1289/EHP9112 |
work_keys_str_mv | AT scholtenbernice automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT simonlauraguerrero automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT krishnanshaji automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT vermeulenroel automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT pronkanjoeka automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT gyoribenjaminm automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT bachmanjohna automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT vlaanderenjelle automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment AT stierumrob automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment |