Cargando…

Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review

In modern drug discovery, the combination of chemoinformatics and quantitative structure–activity relationship (QSAR) modeling has emerged as a formidable alliance, enabling researchers to harness the vast potential of machine learning (ML) techniques for predictive molecular design and analysis. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Niazi, Sarfaraz K., Mariam, Zamara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380192/
https://www.ncbi.nlm.nih.gov/pubmed/37511247
http://dx.doi.org/10.3390/ijms241411488
_version_ 1785080125420732416
author Niazi, Sarfaraz K.
Mariam, Zamara
author_facet Niazi, Sarfaraz K.
Mariam, Zamara
author_sort Niazi, Sarfaraz K.
collection PubMed
description In modern drug discovery, the combination of chemoinformatics and quantitative structure–activity relationship (QSAR) modeling has emerged as a formidable alliance, enabling researchers to harness the vast potential of machine learning (ML) techniques for predictive molecular design and analysis. This review delves into the fundamental aspects of chemoinformatics, elucidating the intricate nature of chemical data and the crucial role of molecular descriptors in unveiling the underlying molecular properties. Molecular descriptors, including 2D fingerprints and topological indices, in conjunction with the structure–activity relationships (SARs), are pivotal in unlocking the pathway to small-molecule drug discovery. Technical intricacies of developing robust ML-QSAR models, including feature selection, model validation, and performance evaluation, are discussed herewith. Various ML algorithms, such as regression analysis and support vector machines, are showcased in the text for their ability to predict and comprehend the relationships between molecular structures and biological activities. This review serves as a comprehensive guide for researchers, providing an understanding of the synergy between chemoinformatics, QSAR, and ML. Due to embracing these cutting-edge technologies, predictive molecular analysis holds promise for expediting the discovery of novel therapeutic agents in the pharmaceutical sciences.
format Online
Article
Text
id pubmed-10380192
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103801922023-07-29 Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review Niazi, Sarfaraz K. Mariam, Zamara Int J Mol Sci Review In modern drug discovery, the combination of chemoinformatics and quantitative structure–activity relationship (QSAR) modeling has emerged as a formidable alliance, enabling researchers to harness the vast potential of machine learning (ML) techniques for predictive molecular design and analysis. This review delves into the fundamental aspects of chemoinformatics, elucidating the intricate nature of chemical data and the crucial role of molecular descriptors in unveiling the underlying molecular properties. Molecular descriptors, including 2D fingerprints and topological indices, in conjunction with the structure–activity relationships (SARs), are pivotal in unlocking the pathway to small-molecule drug discovery. Technical intricacies of developing robust ML-QSAR models, including feature selection, model validation, and performance evaluation, are discussed herewith. Various ML algorithms, such as regression analysis and support vector machines, are showcased in the text for their ability to predict and comprehend the relationships between molecular structures and biological activities. This review serves as a comprehensive guide for researchers, providing an understanding of the synergy between chemoinformatics, QSAR, and ML. Due to embracing these cutting-edge technologies, predictive molecular analysis holds promise for expediting the discovery of novel therapeutic agents in the pharmaceutical sciences. MDPI 2023-07-15 /pmc/articles/PMC10380192/ /pubmed/37511247 http://dx.doi.org/10.3390/ijms241411488 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Niazi, Sarfaraz K.
Mariam, Zamara
Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title_full Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title_fullStr Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title_full_unstemmed Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title_short Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review
title_sort recent advances in machine-learning-based chemoinformatics: a comprehensive review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380192/
https://www.ncbi.nlm.nih.gov/pubmed/37511247
http://dx.doi.org/10.3390/ijms241411488
work_keys_str_mv AT niazisarfarazk recentadvancesinmachinelearningbasedchemoinformaticsacomprehensivereview
AT mariamzamara recentadvancesinmachinelearningbasedchemoinformaticsacomprehensivereview