Cargando…

Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment

BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more e...

Descripción completa

Detalles Bibliográficos
Autores principales: Scholten, Bernice, Simón, Laura Guerrero, Krishnan, Shaji, Vermeulen, Roel, Pronk, Anjoeka, Gyori, Benjamin M., Bachman, John A., Vlaanderen, Jelle, Stierum, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Environmental Health Perspectives 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893280/
https://www.ncbi.nlm.nih.gov/pubmed/35238605
http://dx.doi.org/10.1289/EHP9112
_version_ 1784662356706459648
author Scholten, Bernice
Simón, Laura Guerrero
Krishnan, Shaji
Vermeulen, Roel
Pronk, Anjoeka
Gyori, Benjamin M.
Bachman, John A.
Vlaanderen, Jelle
Stierum, Rob
author_facet Scholten, Bernice
Simón, Laura Guerrero
Krishnan, Shaji
Vermeulen, Roel
Pronk, Anjoeka
Gyori, Benjamin M.
Bachman, John A.
Vlaanderen, Jelle
Stierum, Rob
author_sort Scholten, Bernice
collection PubMed
description BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools. METHODS: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., “benzene activates DNA damage”). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach. RESULTS: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation. DISCUSSION: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112
format Online
Article
Text
id pubmed-8893280
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Environmental Health Perspectives
record_format MEDLINE/PubMed
spelling pubmed-88932802022-03-07 Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment Scholten, Bernice Simón, Laura Guerrero Krishnan, Shaji Vermeulen, Roel Pronk, Anjoeka Gyori, Benjamin M. Bachman, John A. Vlaanderen, Jelle Stierum, Rob Environ Health Perspect Research BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools. METHODS: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., “benzene activates DNA damage”). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach. RESULTS: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation. DISCUSSION: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112 Environmental Health Perspectives 2022-03-03 /pmc/articles/PMC8893280/ /pubmed/35238605 http://dx.doi.org/10.1289/EHP9112 Text en https://ehp.niehs.nih.gov/about-ehp/licenseEHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted.
spellingShingle Research
Scholten, Bernice
Simón, Laura Guerrero
Krishnan, Shaji
Vermeulen, Roel
Pronk, Anjoeka
Gyori, Benjamin M.
Bachman, John A.
Vlaanderen, Jelle
Stierum, Rob
Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title_full Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title_fullStr Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title_full_unstemmed Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title_short Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
title_sort automated network assembly of mechanistic literature for informed evidence identification to support cancer risk assessment
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893280/
https://www.ncbi.nlm.nih.gov/pubmed/35238605
http://dx.doi.org/10.1289/EHP9112
work_keys_str_mv AT scholtenbernice automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT simonlauraguerrero automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT krishnanshaji automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT vermeulenroel automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT pronkanjoeka automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT gyoribenjaminm automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT bachmanjohna automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT vlaanderenjelle automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment
AT stierumrob automatednetworkassemblyofmechanisticliteratureforinformedevidenceidentificationtosupportcancerriskassessment