Cargando…

Molecular function recognition by supervised projection pursuit machine learning

Identifying mechanisms that control molecular function is a significant challenge in pharmaceutical science and molecular engineering. Here, we present a novel projection pursuit recurrent neural network to identify functional mechanisms in the context of iterative supervised machine learning for di...

Descripción completa

Detalles Bibliográficos
Autores principales: Grear, Tyler, Avery, Chris, Patterson, John, Jacobs, Donald J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895977/
https://www.ncbi.nlm.nih.gov/pubmed/33608593
http://dx.doi.org/10.1038/s41598-021-83269-y
_version_ 1783653467138555904
author Grear, Tyler
Avery, Chris
Patterson, John
Jacobs, Donald J.
author_facet Grear, Tyler
Avery, Chris
Patterson, John
Jacobs, Donald J.
author_sort Grear, Tyler
collection PubMed
description Identifying mechanisms that control molecular function is a significant challenge in pharmaceutical science and molecular engineering. Here, we present a novel projection pursuit recurrent neural network to identify functional mechanisms in the context of iterative supervised machine learning for discovery-based design optimization. Molecular function recognition is achieved by pairing experiments that categorize systems with digital twin molecular dynamics simulations to generate working hypotheses. Feature extraction decomposes emergent properties of a system into a complete set of basis vectors. Feature selection requires signal-to-noise, statistical significance, and clustering quality to concurrently surpass acceptance levels. Formulated as a multivariate description of differences and similarities between systems, the data-driven working hypothesis is refined by analyzing new systems prioritized by a discovery-likelihood. Utility and generality are demonstrated on several benchmarks, including the elucidation of antibiotic resistance in TEM-52 beta-lactamase. The software is freely available, enabling turnkey analysis of massive data streams found in computational biology and material science.
format Online
Article
Text
id pubmed-7895977
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78959772021-02-24 Molecular function recognition by supervised projection pursuit machine learning Grear, Tyler Avery, Chris Patterson, John Jacobs, Donald J. Sci Rep Article Identifying mechanisms that control molecular function is a significant challenge in pharmaceutical science and molecular engineering. Here, we present a novel projection pursuit recurrent neural network to identify functional mechanisms in the context of iterative supervised machine learning for discovery-based design optimization. Molecular function recognition is achieved by pairing experiments that categorize systems with digital twin molecular dynamics simulations to generate working hypotheses. Feature extraction decomposes emergent properties of a system into a complete set of basis vectors. Feature selection requires signal-to-noise, statistical significance, and clustering quality to concurrently surpass acceptance levels. Formulated as a multivariate description of differences and similarities between systems, the data-driven working hypothesis is refined by analyzing new systems prioritized by a discovery-likelihood. Utility and generality are demonstrated on several benchmarks, including the elucidation of antibiotic resistance in TEM-52 beta-lactamase. The software is freely available, enabling turnkey analysis of massive data streams found in computational biology and material science. Nature Publishing Group UK 2021-02-19 /pmc/articles/PMC7895977/ /pubmed/33608593 http://dx.doi.org/10.1038/s41598-021-83269-y Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Grear, Tyler
Avery, Chris
Patterson, John
Jacobs, Donald J.
Molecular function recognition by supervised projection pursuit machine learning
title Molecular function recognition by supervised projection pursuit machine learning
title_full Molecular function recognition by supervised projection pursuit machine learning
title_fullStr Molecular function recognition by supervised projection pursuit machine learning
title_full_unstemmed Molecular function recognition by supervised projection pursuit machine learning
title_short Molecular function recognition by supervised projection pursuit machine learning
title_sort molecular function recognition by supervised projection pursuit machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895977/
https://www.ncbi.nlm.nih.gov/pubmed/33608593
http://dx.doi.org/10.1038/s41598-021-83269-y
work_keys_str_mv AT greartyler molecularfunctionrecognitionbysupervisedprojectionpursuitmachinelearning
AT averychris molecularfunctionrecognitionbysupervisedprojectionpursuitmachinelearning
AT pattersonjohn molecularfunctionrecognitionbysupervisedprojectionpursuitmachinelearning
AT jacobsdonaldj molecularfunctionrecognitionbysupervisedprojectionpursuitmachinelearning