Cargando…
Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581705/ https://www.ncbi.nlm.nih.gov/pubmed/31245383 http://dx.doi.org/10.3389/fmolb.2019.00042 |
_version_ | 1783428194283552768 |
---|---|
author | Grazioli, Gianmarc Martin, Rachel W. Butts, Carter T. |
author_facet | Grazioli, Gianmarc Martin, Rachel W. Butts, Carter T. |
author_sort | Grazioli, Gianmarc |
collection | PubMed |
description | Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Aβ(1−40)) and its “Arctic” variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics. |
format | Online Article Text |
id | pubmed-6581705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65817052019-06-26 Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods Grazioli, Gianmarc Martin, Rachel W. Butts, Carter T. Front Mol Biosci Molecular Biosciences Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Aβ(1−40)) and its “Arctic” variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics. Frontiers Media S.A. 2019-06-12 /pmc/articles/PMC6581705/ /pubmed/31245383 http://dx.doi.org/10.3389/fmolb.2019.00042 Text en Copyright © 2019 Grazioli, Martin and Butts. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Grazioli, Gianmarc Martin, Rachel W. Butts, Carter T. Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title | Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title_full | Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title_fullStr | Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title_full_unstemmed | Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title_short | Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods |
title_sort | comparative exploratory analysis of intrinsically disordered protein dynamics using machine learning and network analytic methods |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581705/ https://www.ncbi.nlm.nih.gov/pubmed/31245383 http://dx.doi.org/10.3389/fmolb.2019.00042 |
work_keys_str_mv | AT grazioligianmarc comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods AT martinrachelw comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods AT buttscartert comparativeexploratoryanalysisofintrinsicallydisorderedproteindynamicsusingmachinelearningandnetworkanalyticmethods |