Cargando…
The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321082/ https://www.ncbi.nlm.nih.gov/pubmed/32545393 http://dx.doi.org/10.3390/molecules25112715 |
_version_ | 1783551382568042496 |
---|---|
author | Madkhali, Marwah M.M. Rankine, Conor D. Penfold, Thomas J. |
author_facet | Madkhali, Marwah M.M. Rankine, Conor D. Penfold, Thomas J. |
author_sort | Madkhali, Marwah M.M. |
collection | PubMed |
description | An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge structure (XANES) spectra, and address the question: How important is the choice of representation for the local environment around an arbitrary Fe absorption site? Using two popular representations of chemical space—the Coulomb matrix (CM) and pair-distribution/radial distribution curve (RDC)—we investigate the effect that the choice of representation has on the performance of our DNN. While CM and RDC featurisation are demonstrably robust descriptors, it is possible to obtain a smaller mean squared error (MSE) between the target and estimated XANES spectra when using RDC featurisation, and converge to this state a) faster and b) using fewer data samples. This is advantageous for future extension of our DNN to other X-ray absorption edges, and for reoptimisation of our DNN to reproduce results from higher levels of theory. In the latter case, dataset sizes will be limited more strongly by the resource-intensive nature of the underlying theoretical calculations. |
format | Online Article Text |
id | pubmed-7321082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73210822020-07-06 The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy Madkhali, Marwah M.M. Rankine, Conor D. Penfold, Thomas J. Molecules Article An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge structure (XANES) spectra, and address the question: How important is the choice of representation for the local environment around an arbitrary Fe absorption site? Using two popular representations of chemical space—the Coulomb matrix (CM) and pair-distribution/radial distribution curve (RDC)—we investigate the effect that the choice of representation has on the performance of our DNN. While CM and RDC featurisation are demonstrably robust descriptors, it is possible to obtain a smaller mean squared error (MSE) between the target and estimated XANES spectra when using RDC featurisation, and converge to this state a) faster and b) using fewer data samples. This is advantageous for future extension of our DNN to other X-ray absorption edges, and for reoptimisation of our DNN to reproduce results from higher levels of theory. In the latter case, dataset sizes will be limited more strongly by the resource-intensive nature of the underlying theoretical calculations. MDPI 2020-06-11 /pmc/articles/PMC7321082/ /pubmed/32545393 http://dx.doi.org/10.3390/molecules25112715 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Madkhali, Marwah M.M. Rankine, Conor D. Penfold, Thomas J. The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title | The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title_full | The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title_fullStr | The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title_full_unstemmed | The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title_short | The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy |
title_sort | role of structural representation in the performance of a deep neural network for x-ray spectroscopy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321082/ https://www.ncbi.nlm.nih.gov/pubmed/32545393 http://dx.doi.org/10.3390/molecules25112715 |
work_keys_str_mv | AT madkhalimarwahmm theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy AT rankineconord theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy AT penfoldthomasj theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy AT madkhalimarwahmm roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy AT rankineconord roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy AT penfoldthomasj roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy |