Cargando…

A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)

[Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PF...

Descripción completa

Detalles Bibliográficos
Autores principales: Richard, Ann M., Lougee, Ryan, Adams, Matthew, Hidle, Hannah, Yang, Chihae, Rathman, James, Magdziarz, Tomasz, Bienfait, Bruno, Williams, Antony J., Patlewicz, Grace
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031568/
https://www.ncbi.nlm.nih.gov/pubmed/36862450
http://dx.doi.org/10.1021/acs.chemrestox.2c00403
_version_ 1784910633172467712
author Richard, Ann M.
Lougee, Ryan
Adams, Matthew
Hidle, Hannah
Yang, Chihae
Rathman, James
Magdziarz, Tomasz
Bienfait, Bruno
Williams, Antony J.
Patlewicz, Grace
author_facet Richard, Ann M.
Lougee, Ryan
Adams, Matthew
Hidle, Hannah
Yang, Chihae
Rathman, James
Magdziarz, Tomasz
Bienfait, Bruno
Williams, Antony J.
Patlewicz, Grace
author_sort Richard, Ann M.
collection PubMed
description [Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PFASSTRUCTV5 inventory on EPA’s CompTox Chemicals Dashboard, has come increased motivation to profile, categorize, and analyze the PFAS structure space using modern cheminformatics approaches. Making use of the publicly available ToxPrint chemotypes and ChemoTyper application, we have developed a new PFAS-specific fingerprint set consisting of 129 TxP_PFAS chemotypes coded in CSRML, a chemical-based XML-query language. These are split into two groups, the first containing 56 mostly bond-type ToxPrints modified to incorporate attachment to either a CF group or F atom to enforce proximity to the fluorinated portion of the chemical. This focus resulted in a dramatic reduction in TxP_PFAS chemotype counts relative to the corresponding ToxPrint counts (averaging 54%). The remaining TxP_PFAS chemotypes consist of various lengths and types of fluorinated chains, rings, and bonding patterns covering indications of branching, alternate halogenation, and fluorotelomers. Both groups of chemotypes are well represented across the PFASSTRUCT inventory. Using the ChemoTyper application, we show how the TxP_PFAS chemotypes can be visualized, filtered, and used to profile the PFASSTRUCT inventory, as well as to construct chemically intuitive, structure-based PFAS categories. Lastly, we used a selection of expert-based PFAS categories from the OECD Global PFAS list to evaluate a small set of analogous structure-based TxP_PFAS categories. TxP_PFAS chemotypes were able to recapitulate the expert-based PFAS category concepts based on clearly defined structure rules that can be computationally implemented and reproducibly applied to process large PFAS inventories without need to consult an expert. The TxP_PFAS chemotypes have the potential to support computational modeling, harmonize PFAS structure-based categories, facilitate communication, and allow for more efficient and chemically informed exploration of PFAS chemicals moving forward.
format Online
Article
Text
id pubmed-10031568
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-100315682023-03-23 A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) Richard, Ann M. Lougee, Ryan Adams, Matthew Hidle, Hannah Yang, Chihae Rathman, James Magdziarz, Tomasz Bienfait, Bruno Williams, Antony J. Patlewicz, Grace Chem Res Toxicol [Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PFASSTRUCTV5 inventory on EPA’s CompTox Chemicals Dashboard, has come increased motivation to profile, categorize, and analyze the PFAS structure space using modern cheminformatics approaches. Making use of the publicly available ToxPrint chemotypes and ChemoTyper application, we have developed a new PFAS-specific fingerprint set consisting of 129 TxP_PFAS chemotypes coded in CSRML, a chemical-based XML-query language. These are split into two groups, the first containing 56 mostly bond-type ToxPrints modified to incorporate attachment to either a CF group or F atom to enforce proximity to the fluorinated portion of the chemical. This focus resulted in a dramatic reduction in TxP_PFAS chemotype counts relative to the corresponding ToxPrint counts (averaging 54%). The remaining TxP_PFAS chemotypes consist of various lengths and types of fluorinated chains, rings, and bonding patterns covering indications of branching, alternate halogenation, and fluorotelomers. Both groups of chemotypes are well represented across the PFASSTRUCT inventory. Using the ChemoTyper application, we show how the TxP_PFAS chemotypes can be visualized, filtered, and used to profile the PFASSTRUCT inventory, as well as to construct chemically intuitive, structure-based PFAS categories. Lastly, we used a selection of expert-based PFAS categories from the OECD Global PFAS list to evaluate a small set of analogous structure-based TxP_PFAS categories. TxP_PFAS chemotypes were able to recapitulate the expert-based PFAS category concepts based on clearly defined structure rules that can be computationally implemented and reproducibly applied to process large PFAS inventories without need to consult an expert. The TxP_PFAS chemotypes have the potential to support computational modeling, harmonize PFAS structure-based categories, facilitate communication, and allow for more efficient and chemically informed exploration of PFAS chemicals moving forward. American Chemical Society 2023-03-02 /pmc/articles/PMC10031568/ /pubmed/36862450 http://dx.doi.org/10.1021/acs.chemrestox.2c00403 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Richard, Ann M.
Lougee, Ryan
Adams, Matthew
Hidle, Hannah
Yang, Chihae
Rathman, James
Magdziarz, Tomasz
Bienfait, Bruno
Williams, Antony J.
Patlewicz, Grace
A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title_full A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title_fullStr A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title_full_unstemmed A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title_short A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
title_sort new csrml structure-based fingerprint method for profiling and categorizing per- and polyfluoroalkyl substances (pfas)
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031568/
https://www.ncbi.nlm.nih.gov/pubmed/36862450
http://dx.doi.org/10.1021/acs.chemrestox.2c00403
work_keys_str_mv AT richardannm anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT lougeeryan anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT adamsmatthew anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT hidlehannah anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT yangchihae anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT rathmanjames anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT magdziarztomasz anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT bienfaitbruno anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT williamsantonyj anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT patlewiczgrace anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT richardannm newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT lougeeryan newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT adamsmatthew newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT hidlehannah newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT yangchihae newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT rathmanjames newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT magdziarztomasz newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT bienfaitbruno newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT williamsantonyj newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas
AT patlewiczgrace newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas