Cargando…
A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS)
[Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PF...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031568/ https://www.ncbi.nlm.nih.gov/pubmed/36862450 http://dx.doi.org/10.1021/acs.chemrestox.2c00403 |
_version_ | 1784910633172467712 |
---|---|
author | Richard, Ann M. Lougee, Ryan Adams, Matthew Hidle, Hannah Yang, Chihae Rathman, James Magdziarz, Tomasz Bienfait, Bruno Williams, Antony J. Patlewicz, Grace |
author_facet | Richard, Ann M. Lougee, Ryan Adams, Matthew Hidle, Hannah Yang, Chihae Rathman, James Magdziarz, Tomasz Bienfait, Bruno Williams, Antony J. Patlewicz, Grace |
author_sort | Richard, Ann M. |
collection | PubMed |
description | [Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PFASSTRUCTV5 inventory on EPA’s CompTox Chemicals Dashboard, has come increased motivation to profile, categorize, and analyze the PFAS structure space using modern cheminformatics approaches. Making use of the publicly available ToxPrint chemotypes and ChemoTyper application, we have developed a new PFAS-specific fingerprint set consisting of 129 TxP_PFAS chemotypes coded in CSRML, a chemical-based XML-query language. These are split into two groups, the first containing 56 mostly bond-type ToxPrints modified to incorporate attachment to either a CF group or F atom to enforce proximity to the fluorinated portion of the chemical. This focus resulted in a dramatic reduction in TxP_PFAS chemotype counts relative to the corresponding ToxPrint counts (averaging 54%). The remaining TxP_PFAS chemotypes consist of various lengths and types of fluorinated chains, rings, and bonding patterns covering indications of branching, alternate halogenation, and fluorotelomers. Both groups of chemotypes are well represented across the PFASSTRUCT inventory. Using the ChemoTyper application, we show how the TxP_PFAS chemotypes can be visualized, filtered, and used to profile the PFASSTRUCT inventory, as well as to construct chemically intuitive, structure-based PFAS categories. Lastly, we used a selection of expert-based PFAS categories from the OECD Global PFAS list to evaluate a small set of analogous structure-based TxP_PFAS categories. TxP_PFAS chemotypes were able to recapitulate the expert-based PFAS category concepts based on clearly defined structure rules that can be computationally implemented and reproducibly applied to process large PFAS inventories without need to consult an expert. The TxP_PFAS chemotypes have the potential to support computational modeling, harmonize PFAS structure-based categories, facilitate communication, and allow for more efficient and chemically informed exploration of PFAS chemicals moving forward. |
format | Online Article Text |
id | pubmed-10031568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-100315682023-03-23 A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) Richard, Ann M. Lougee, Ryan Adams, Matthew Hidle, Hannah Yang, Chihae Rathman, James Magdziarz, Tomasz Bienfait, Bruno Williams, Antony J. Patlewicz, Grace Chem Res Toxicol [Image: see text] The term PFAS encompasses diverse per- and polyfluorinated alkyl (and increasingly aromatic) chemicals spanning industrial processes, commercial uses, environmental occurrence, and potential concerns. With increased chemical curation, currently exceeding 14,000 structures in the PFASSTRUCTV5 inventory on EPA’s CompTox Chemicals Dashboard, has come increased motivation to profile, categorize, and analyze the PFAS structure space using modern cheminformatics approaches. Making use of the publicly available ToxPrint chemotypes and ChemoTyper application, we have developed a new PFAS-specific fingerprint set consisting of 129 TxP_PFAS chemotypes coded in CSRML, a chemical-based XML-query language. These are split into two groups, the first containing 56 mostly bond-type ToxPrints modified to incorporate attachment to either a CF group or F atom to enforce proximity to the fluorinated portion of the chemical. This focus resulted in a dramatic reduction in TxP_PFAS chemotype counts relative to the corresponding ToxPrint counts (averaging 54%). The remaining TxP_PFAS chemotypes consist of various lengths and types of fluorinated chains, rings, and bonding patterns covering indications of branching, alternate halogenation, and fluorotelomers. Both groups of chemotypes are well represented across the PFASSTRUCT inventory. Using the ChemoTyper application, we show how the TxP_PFAS chemotypes can be visualized, filtered, and used to profile the PFASSTRUCT inventory, as well as to construct chemically intuitive, structure-based PFAS categories. Lastly, we used a selection of expert-based PFAS categories from the OECD Global PFAS list to evaluate a small set of analogous structure-based TxP_PFAS categories. TxP_PFAS chemotypes were able to recapitulate the expert-based PFAS category concepts based on clearly defined structure rules that can be computationally implemented and reproducibly applied to process large PFAS inventories without need to consult an expert. The TxP_PFAS chemotypes have the potential to support computational modeling, harmonize PFAS structure-based categories, facilitate communication, and allow for more efficient and chemically informed exploration of PFAS chemicals moving forward. American Chemical Society 2023-03-02 /pmc/articles/PMC10031568/ /pubmed/36862450 http://dx.doi.org/10.1021/acs.chemrestox.2c00403 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Richard, Ann M. Lougee, Ryan Adams, Matthew Hidle, Hannah Yang, Chihae Rathman, James Magdziarz, Tomasz Bienfait, Bruno Williams, Antony J. Patlewicz, Grace A New CSRML Structure-Based Fingerprint Method for Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title | A New CSRML
Structure-Based Fingerprint Method for
Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title_full | A New CSRML
Structure-Based Fingerprint Method for
Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title_fullStr | A New CSRML
Structure-Based Fingerprint Method for
Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title_full_unstemmed | A New CSRML
Structure-Based Fingerprint Method for
Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title_short | A New CSRML
Structure-Based Fingerprint Method for
Profiling and Categorizing Per- and Polyfluoroalkyl Substances (PFAS) |
title_sort | new csrml
structure-based fingerprint method for
profiling and categorizing per- and polyfluoroalkyl substances (pfas) |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031568/ https://www.ncbi.nlm.nih.gov/pubmed/36862450 http://dx.doi.org/10.1021/acs.chemrestox.2c00403 |
work_keys_str_mv | AT richardannm anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT lougeeryan anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT adamsmatthew anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT hidlehannah anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT yangchihae anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT rathmanjames anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT magdziarztomasz anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT bienfaitbruno anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT williamsantonyj anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT patlewiczgrace anewcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT richardannm newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT lougeeryan newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT adamsmatthew newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT hidlehannah newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT yangchihae newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT rathmanjames newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT magdziarztomasz newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT bienfaitbruno newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT williamsantonyj newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas AT patlewiczgrace newcsrmlstructurebasedfingerprintmethodforprofilingandcategorizingperandpolyfluoroalkylsubstancespfas |