Cargando…

Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures

Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning too...

Descripción completa

Detalles Bibliográficos
Autores principales: Serafimova, Katerina, Mihaylov, Iliyan, Vassilev, Dimitar, Avdjieva, Irena, Zielenkiewicz, Piotr, Kaczanowski, Szymon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304015/
http://dx.doi.org/10.1007/978-3-030-50420-5_43
_version_ 1783548180264124416
author Serafimova, Katerina
Mihaylov, Iliyan
Vassilev, Dimitar
Avdjieva, Irena
Zielenkiewicz, Piotr
Kaczanowski, Szymon
author_facet Serafimova, Katerina
Mihaylov, Iliyan
Vassilev, Dimitar
Avdjieva, Irena
Zielenkiewicz, Piotr
Kaczanowski, Szymon
author_sort Serafimova, Katerina
collection PubMed
description Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning tools and on approaches that extract knowledge and rules from data using probabilistic models. Many of these methods use scoring functions to determine which structure best fits a native protein sequence. Using computational approaches, we obtained two scoring functions: knowledge-based energy and likelihood of base frequency, and we compared their accuracy in measuring the sequence structure fit. We compared the machine learning models’ accuracy of predictions for knowledge-based energy and likelihood values to validate our results, showing that likelihood is a more accurate scoring function than knowledge-based energy.
format Online
Article
Text
id pubmed-7304015
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73040152020-06-19 Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures Serafimova, Katerina Mihaylov, Iliyan Vassilev, Dimitar Avdjieva, Irena Zielenkiewicz, Piotr Kaczanowski, Szymon Computational Science – ICCS 2020 Article Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning tools and on approaches that extract knowledge and rules from data using probabilistic models. Many of these methods use scoring functions to determine which structure best fits a native protein sequence. Using computational approaches, we obtained two scoring functions: knowledge-based energy and likelihood of base frequency, and we compared their accuracy in measuring the sequence structure fit. We compared the machine learning models’ accuracy of predictions for knowledge-based energy and likelihood values to validate our results, showing that likelihood is a more accurate scoring function than knowledge-based energy. 2020-05-22 /pmc/articles/PMC7304015/ http://dx.doi.org/10.1007/978-3-030-50420-5_43 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Serafimova, Katerina
Mihaylov, Iliyan
Vassilev, Dimitar
Avdjieva, Irena
Zielenkiewicz, Piotr
Kaczanowski, Szymon
Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title_full Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title_fullStr Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title_full_unstemmed Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title_short Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
title_sort using machine learning in accuracy assessment of knowledge-based energy and frequency base likelihood in protein structures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304015/
http://dx.doi.org/10.1007/978-3-030-50420-5_43
work_keys_str_mv AT serafimovakaterina usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures
AT mihayloviliyan usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures
AT vassilevdimitar usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures
AT avdjievairena usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures
AT zielenkiewiczpiotr usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures
AT kaczanowskiszymon usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures