Cargando…

The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy

An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge s...

Descripción completa

Detalles Bibliográficos
Autores principales: Madkhali, Marwah M.M., Rankine, Conor D., Penfold, Thomas J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321082/
https://www.ncbi.nlm.nih.gov/pubmed/32545393
http://dx.doi.org/10.3390/molecules25112715
_version_ 1783551382568042496
author Madkhali, Marwah M.M.
Rankine, Conor D.
Penfold, Thomas J.
author_facet Madkhali, Marwah M.M.
Rankine, Conor D.
Penfold, Thomas J.
author_sort Madkhali, Marwah M.M.
collection PubMed
description An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge structure (XANES) spectra, and address the question: How important is the choice of representation for the local environment around an arbitrary Fe absorption site? Using two popular representations of chemical space—the Coulomb matrix (CM) and pair-distribution/radial distribution curve (RDC)—we investigate the effect that the choice of representation has on the performance of our DNN. While CM and RDC featurisation are demonstrably robust descriptors, it is possible to obtain a smaller mean squared error (MSE) between the target and estimated XANES spectra when using RDC featurisation, and converge to this state a) faster and b) using fewer data samples. This is advantageous for future extension of our DNN to other X-ray absorption edges, and for reoptimisation of our DNN to reproduce results from higher levels of theory. In the latter case, dataset sizes will be limited more strongly by the resource-intensive nature of the underlying theoretical calculations.
format Online
Article
Text
id pubmed-7321082
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73210822020-07-06 The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy Madkhali, Marwah M.M. Rankine, Conor D. Penfold, Thomas J. Molecules Article An important consideration when developing a deep neural network (DNN) for the prediction of molecular properties is the representation of the chemical space. Herein we explore the effect of the representation on the performance of our DNN engineered to predict Fe K-edge X-ray absorption near-edge structure (XANES) spectra, and address the question: How important is the choice of representation for the local environment around an arbitrary Fe absorption site? Using two popular representations of chemical space—the Coulomb matrix (CM) and pair-distribution/radial distribution curve (RDC)—we investigate the effect that the choice of representation has on the performance of our DNN. While CM and RDC featurisation are demonstrably robust descriptors, it is possible to obtain a smaller mean squared error (MSE) between the target and estimated XANES spectra when using RDC featurisation, and converge to this state a) faster and b) using fewer data samples. This is advantageous for future extension of our DNN to other X-ray absorption edges, and for reoptimisation of our DNN to reproduce results from higher levels of theory. In the latter case, dataset sizes will be limited more strongly by the resource-intensive nature of the underlying theoretical calculations. MDPI 2020-06-11 /pmc/articles/PMC7321082/ /pubmed/32545393 http://dx.doi.org/10.3390/molecules25112715 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Madkhali, Marwah M.M.
Rankine, Conor D.
Penfold, Thomas J.
The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title_full The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title_fullStr The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title_full_unstemmed The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title_short The Role of Structural Representation in the Performance of a Deep Neural Network for X-ray Spectroscopy
title_sort role of structural representation in the performance of a deep neural network for x-ray spectroscopy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321082/
https://www.ncbi.nlm.nih.gov/pubmed/32545393
http://dx.doi.org/10.3390/molecules25112715
work_keys_str_mv AT madkhalimarwahmm theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy
AT rankineconord theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy
AT penfoldthomasj theroleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy
AT madkhalimarwahmm roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy
AT rankineconord roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy
AT penfoldthomasj roleofstructuralrepresentationintheperformanceofadeepneuralnetworkforxrayspectroscopy