Cargando…

Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure

[Image: see text] Traditional methodologies for assessing chemical toxicity are expensive and time-consuming. Computational modeling approaches have emerged as low-cost alternatives, especially those used to develop quantitative structure–activity relationship (QSAR) models. However, conventional QS...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Elena, Russo, Daniel P., Ciallella, Heather L., Wang, Yu-Tang, Wu, Min, Aleksunes, Lauren M., Zhu, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134506/
https://www.ncbi.nlm.nih.gov/pubmed/37040559
http://dx.doi.org/10.1021/acs.est.3c00648
_version_ 1785031778265726976
author Chung, Elena
Russo, Daniel P.
Ciallella, Heather L.
Wang, Yu-Tang
Wu, Min
Aleksunes, Lauren M.
Zhu, Hao
author_facet Chung, Elena
Russo, Daniel P.
Ciallella, Heather L.
Wang, Yu-Tang
Wu, Min
Aleksunes, Lauren M.
Zhu, Hao
author_sort Chung, Elena
collection PubMed
description [Image: see text] Traditional methodologies for assessing chemical toxicity are expensive and time-consuming. Computational modeling approaches have emerged as low-cost alternatives, especially those used to develop quantitative structure–activity relationship (QSAR) models. However, conventional QSAR models have limited training data, leading to low predictivity for new compounds. We developed a data-driven modeling approach for constructing carcinogenicity-related models and used these models to identify potential new human carcinogens. To this goal, we used a probe carcinogen dataset from the US Environmental Protection Agency’s Integrated Risk Information System (IRIS) to identify relevant PubChem bioassays. Responses of 25 PubChem assays were significantly relevant to carcinogenicity. Eight assays inferred carcinogenicity predictivity and were selected for QSAR model training. Using 5 machine learning algorithms and 3 types of chemical fingerprints, 15 QSAR models were developed for each PubChem assay dataset. These models showed acceptable predictivity during 5-fold cross-validation (average CCR = 0.71). Using our QSAR models, we can correctly predict and rank 342 IRIS compounds’ carcinogenic potentials (PPV = 0.72). The models predicted potential new carcinogens, which were validated by a literature search. This study portends an automated technique that can be applied to prioritize potential toxicants using validated QSAR models based on extensive training sets from public data resources.
format Online
Article
Text
id pubmed-10134506
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-101345062023-04-28 Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure Chung, Elena Russo, Daniel P. Ciallella, Heather L. Wang, Yu-Tang Wu, Min Aleksunes, Lauren M. Zhu, Hao Environ Sci Technol [Image: see text] Traditional methodologies for assessing chemical toxicity are expensive and time-consuming. Computational modeling approaches have emerged as low-cost alternatives, especially those used to develop quantitative structure–activity relationship (QSAR) models. However, conventional QSAR models have limited training data, leading to low predictivity for new compounds. We developed a data-driven modeling approach for constructing carcinogenicity-related models and used these models to identify potential new human carcinogens. To this goal, we used a probe carcinogen dataset from the US Environmental Protection Agency’s Integrated Risk Information System (IRIS) to identify relevant PubChem bioassays. Responses of 25 PubChem assays were significantly relevant to carcinogenicity. Eight assays inferred carcinogenicity predictivity and were selected for QSAR model training. Using 5 machine learning algorithms and 3 types of chemical fingerprints, 15 QSAR models were developed for each PubChem assay dataset. These models showed acceptable predictivity during 5-fold cross-validation (average CCR = 0.71). Using our QSAR models, we can correctly predict and rank 342 IRIS compounds’ carcinogenic potentials (PPV = 0.72). The models predicted potential new carcinogens, which were validated by a literature search. This study portends an automated technique that can be applied to prioritize potential toxicants using validated QSAR models based on extensive training sets from public data resources. American Chemical Society 2023-04-11 /pmc/articles/PMC10134506/ /pubmed/37040559 http://dx.doi.org/10.1021/acs.est.3c00648 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Chung, Elena
Russo, Daniel P.
Ciallella, Heather L.
Wang, Yu-Tang
Wu, Min
Aleksunes, Lauren M.
Zhu, Hao
Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title_full Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title_fullStr Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title_full_unstemmed Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title_short Data-Driven Quantitative Structure–Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure
title_sort data-driven quantitative structure–activity relationship modeling for human carcinogenicity by chronic oral exposure
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134506/
https://www.ncbi.nlm.nih.gov/pubmed/37040559
http://dx.doi.org/10.1021/acs.est.3c00648
work_keys_str_mv AT chungelena datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT russodanielp datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT ciallellaheatherl datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT wangyutang datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT wumin datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT aleksuneslaurenm datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure
AT zhuhao datadrivenquantitativestructureactivityrelationshipmodelingforhumancarcinogenicitybychronicoralexposure