Cargando…

Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models

[Image: see text] An original approach that adopts machine learning inference to predict protein structural information using hydrogen–deuterium exchange mass spectrometry (HDX-MS) is described. The method exploits an in-house optimization program that increases the resolution of HDX-MS data from pe...

Descripción completa

Detalles Bibliográficos
Autores principales: Salmas, Ramin E., Harris, Matthew J., Borysik, Antoni J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10485923/
https://www.ncbi.nlm.nih.gov/pubmed/37550799
http://dx.doi.org/10.1021/jasms.3c00145
_version_ 1785102888842821632
author Salmas, Ramin E.
Harris, Matthew J.
Borysik, Antoni J.
author_facet Salmas, Ramin E.
Harris, Matthew J.
Borysik, Antoni J.
author_sort Salmas, Ramin E.
collection PubMed
description [Image: see text] An original approach that adopts machine learning inference to predict protein structural information using hydrogen–deuterium exchange mass spectrometry (HDX-MS) is described. The method exploits an in-house optimization program that increases the resolution of HDX-MS data from peptides to amino acids. A system is trained using Gradient Tree Boosting as a type of machine learning ensemble technique to assign a protein secondary structure. Using limited training data we generate a discriminative model that uses optimized HDX-MS data to predict protein secondary structure with an accuracy of 75%. This research could form the basis for new methods exploiting artificial intelligence to model protein conformations by HDX-MS.
format Online
Article
Text
id pubmed-10485923
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-104859232023-09-09 Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models Salmas, Ramin E. Harris, Matthew J. Borysik, Antoni J. J Am Soc Mass Spectrom [Image: see text] An original approach that adopts machine learning inference to predict protein structural information using hydrogen–deuterium exchange mass spectrometry (HDX-MS) is described. The method exploits an in-house optimization program that increases the resolution of HDX-MS data from peptides to amino acids. A system is trained using Gradient Tree Boosting as a type of machine learning ensemble technique to assign a protein secondary structure. Using limited training data we generate a discriminative model that uses optimized HDX-MS data to predict protein secondary structure with an accuracy of 75%. This research could form the basis for new methods exploiting artificial intelligence to model protein conformations by HDX-MS. American Chemical Society 2023-08-07 /pmc/articles/PMC10485923/ /pubmed/37550799 http://dx.doi.org/10.1021/jasms.3c00145 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Salmas, Ramin E.
Harris, Matthew J.
Borysik, Antoni J.
Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title_full Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title_fullStr Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title_full_unstemmed Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title_short Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models
title_sort mapping hdx-ms data to protein conformations through training ensemble-based models
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10485923/
https://www.ncbi.nlm.nih.gov/pubmed/37550799
http://dx.doi.org/10.1021/jasms.3c00145
work_keys_str_mv AT salmasramine mappinghdxmsdatatoproteinconformationsthroughtrainingensemblebasedmodels
AT harrismatthewj mappinghdxmsdatatoproteinconformationsthroughtrainingensemblebasedmodels
AT borysikantonij mappinghdxmsdatatoproteinconformationsthroughtrainingensemblebasedmodels