Cargando…

Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods

Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-...

Descripción completa

Detalles Bibliográficos
Autores principales: Grazioli, Gianmarc, Martin, Rachel W., Butts, Carter T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581705/
https://www.ncbi.nlm.nih.gov/pubmed/31245383
http://dx.doi.org/10.3389/fmolb.2019.00042
_version_ 1783428194283552768
author Grazioli, Gianmarc
Martin, Rachel W.
Butts, Carter T.
author_facet Grazioli, Gianmarc
Martin, Rachel W.
Butts, Carter T.
author_sort Grazioli, Gianmarc
collection PubMed
description Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Aβ(1−40)) and its “Arctic” variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics.
format Online
Article
Text
id pubmed-6581705
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65817052019-06-26 Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods Grazioli, Gianmarc Martin, Rachel W. Butts, Carter T. Front Mol Biosci Molecular Biosciences Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Aβ(1−40)) and its “Arctic” variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics. Frontiers Media S.A. 2019-06-12 /pmc/articles/PMC6581705/ /pubmed/31245383 http://dx.doi.org/10.3389/fmolb.2019.00042 Text en Copyright © 2019 Grazioli, Martin and Butts. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Grazioli, Gianmarc
Martin, Rachel W.
Butts, Carter T.
Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title_full Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title_fullStr Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title_full_unstemmed Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title_short Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
title_sort comparative exploratory analysis of intrinsically disordered protein dynamics using machine learning and network analytic methods
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581705/
https://www.ncbi.nlm.nih.gov/pubmed/31245383
http://dx.doi.org/10.3389/fmolb.2019.00042
work_keys_str_mv AT grazioligianmarc comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods
AT martinrachelw comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods
AT buttscartert comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods