Cargando…

BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds

Compound toxicity prediction is a very challenging and critical task in the drug discovery and design field. Traditionally, cell or animal-based experiments are required to confirm the acute oral toxicity of chemical compounds. However, these methods are often restricted by availability of experimen...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jiarui, Cheong, Hong-Hin, Siu, Shirley Weng In
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197065/
http://dx.doi.org/10.1007/978-3-030-42266-0_12
_version_ 1783528810313940992
author Chen, Jiarui
Cheong, Hong-Hin
Siu, Shirley Weng In
author_facet Chen, Jiarui
Cheong, Hong-Hin
Siu, Shirley Weng In
author_sort Chen, Jiarui
collection PubMed
description Compound toxicity prediction is a very challenging and critical task in the drug discovery and design field. Traditionally, cell or animal-based experiments are required to confirm the acute oral toxicity of chemical compounds. However, these methods are often restricted by availability of experimental facilities, long experimentation time, and high cost. In this paper, we propose a novel convolutional neural network regression model, named BESTox, to predict the acute oral toxicity ([Formula: see text]) of chemical compounds. This model learns the compositional and chemical properties of compounds from their two-dimensional binary matrices. Each matrix encodes the occurrences of certain atom types, number of bonded hydrogens, atom charge, valence, ring, degree, aromaticity, chirality, and hybridization along the SMILES string of a given compound. In a benchmark experiment using a dataset of 7413 observations (train/test 5931/1482), BESTox achieved a squared correlation coefficient ([Formula: see text]) of 0.619, root-mean-squared error (RMSE) of 0.603, and mean absolute error (MAE) of 0.433. Despite of the use of a shallow model architecture and simple molecular descriptors, our method performs comparably against two recently published models.
format Online
Article
Text
id pubmed-7197065
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71970652020-05-04 BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds Chen, Jiarui Cheong, Hong-Hin Siu, Shirley Weng In Algorithms for Computational Biology Article Compound toxicity prediction is a very challenging and critical task in the drug discovery and design field. Traditionally, cell or animal-based experiments are required to confirm the acute oral toxicity of chemical compounds. However, these methods are often restricted by availability of experimental facilities, long experimentation time, and high cost. In this paper, we propose a novel convolutional neural network regression model, named BESTox, to predict the acute oral toxicity ([Formula: see text]) of chemical compounds. This model learns the compositional and chemical properties of compounds from their two-dimensional binary matrices. Each matrix encodes the occurrences of certain atom types, number of bonded hydrogens, atom charge, valence, ring, degree, aromaticity, chirality, and hybridization along the SMILES string of a given compound. In a benchmark experiment using a dataset of 7413 observations (train/test 5931/1482), BESTox achieved a squared correlation coefficient ([Formula: see text]) of 0.619, root-mean-squared error (RMSE) of 0.603, and mean absolute error (MAE) of 0.433. Despite of the use of a shallow model architecture and simple molecular descriptors, our method performs comparably against two recently published models. 2020-02-01 /pmc/articles/PMC7197065/ http://dx.doi.org/10.1007/978-3-030-42266-0_12 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Chen, Jiarui
Cheong, Hong-Hin
Siu, Shirley Weng In
BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title_full BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title_fullStr BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title_full_unstemmed BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title_short BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
title_sort bestox: a convolutional neural network regression model based on binary-encoded smiles for acute oral toxicity prediction of chemical compounds
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197065/
http://dx.doi.org/10.1007/978-3-030-42266-0_12
work_keys_str_mv AT chenjiarui bestoxaconvolutionalneuralnetworkregressionmodelbasedonbinaryencodedsmilesforacuteoraltoxicitypredictionofchemicalcompounds
AT cheonghonghin bestoxaconvolutionalneuralnetworkregressionmodelbasedonbinaryencodedsmilesforacuteoraltoxicitypredictionofchemicalcompounds
AT siushirleywengin bestoxaconvolutionalneuralnetworkregressionmodelbasedonbinaryencodedsmilesforacuteoraltoxicitypredictionofchemicalcompounds