Cargando…

Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning

The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate th...

Descripción completa

Detalles Bibliográficos
Autores principales: Nasser, Maged, Salim, Naomie, Saeed, Faisal, Basurra, Shadi, Rabiu, Idris, Hamza, Hentabli, Alsoufi, Muaadh A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9029813/
https://www.ncbi.nlm.nih.gov/pubmed/35454097
http://dx.doi.org/10.3390/biom12040508
_version_ 1784691991657840640
author Nasser, Maged
Salim, Naomie
Saeed, Faisal
Basurra, Shadi
Rabiu, Idris
Hamza, Hentabli
Alsoufi, Muaadh A.
author_facet Nasser, Maged
Salim, Naomie
Saeed, Faisal
Basurra, Shadi
Rabiu, Idris
Hamza, Hentabli
Alsoufi, Muaadh A.
author_sort Nasser, Maged
collection PubMed
description The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching.
format Online
Article
Text
id pubmed-9029813
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90298132022-04-23 Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning Nasser, Maged Salim, Naomie Saeed, Faisal Basurra, Shadi Rabiu, Idris Hamza, Hentabli Alsoufi, Muaadh A. Biomolecules Article The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching. MDPI 2022-03-27 /pmc/articles/PMC9029813/ /pubmed/35454097 http://dx.doi.org/10.3390/biom12040508 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nasser, Maged
Salim, Naomie
Saeed, Faisal
Basurra, Shadi
Rabiu, Idris
Hamza, Hentabli
Alsoufi, Muaadh A.
Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title_full Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title_fullStr Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title_full_unstemmed Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title_short Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning
title_sort feature reduction for molecular similarity searching based on autoencoder deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9029813/
https://www.ncbi.nlm.nih.gov/pubmed/35454097
http://dx.doi.org/10.3390/biom12040508
work_keys_str_mv AT nassermaged featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT salimnaomie featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT saeedfaisal featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT basurrashadi featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT rabiuidris featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT hamzahentabli featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning
AT alsoufimuaadha featurereductionformolecularsimilaritysearchingbasedonautoencoderdeeplearning