Cargando…
Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures
Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning too...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304015/ http://dx.doi.org/10.1007/978-3-030-50420-5_43 |
_version_ | 1783548180264124416 |
---|---|
author | Serafimova, Katerina Mihaylov, Iliyan Vassilev, Dimitar Avdjieva, Irena Zielenkiewicz, Piotr Kaczanowski, Szymon |
author_facet | Serafimova, Katerina Mihaylov, Iliyan Vassilev, Dimitar Avdjieva, Irena Zielenkiewicz, Piotr Kaczanowski, Szymon |
author_sort | Serafimova, Katerina |
collection | PubMed |
description | Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning tools and on approaches that extract knowledge and rules from data using probabilistic models. Many of these methods use scoring functions to determine which structure best fits a native protein sequence. Using computational approaches, we obtained two scoring functions: knowledge-based energy and likelihood of base frequency, and we compared their accuracy in measuring the sequence structure fit. We compared the machine learning models’ accuracy of predictions for knowledge-based energy and likelihood values to validate our results, showing that likelihood is a more accurate scoring function than knowledge-based energy. |
format | Online Article Text |
id | pubmed-7304015 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73040152020-06-19 Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures Serafimova, Katerina Mihaylov, Iliyan Vassilev, Dimitar Avdjieva, Irena Zielenkiewicz, Piotr Kaczanowski, Szymon Computational Science – ICCS 2020 Article Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning tools and on approaches that extract knowledge and rules from data using probabilistic models. Many of these methods use scoring functions to determine which structure best fits a native protein sequence. Using computational approaches, we obtained two scoring functions: knowledge-based energy and likelihood of base frequency, and we compared their accuracy in measuring the sequence structure fit. We compared the machine learning models’ accuracy of predictions for knowledge-based energy and likelihood values to validate our results, showing that likelihood is a more accurate scoring function than knowledge-based energy. 2020-05-22 /pmc/articles/PMC7304015/ http://dx.doi.org/10.1007/978-3-030-50420-5_43 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Serafimova, Katerina Mihaylov, Iliyan Vassilev, Dimitar Avdjieva, Irena Zielenkiewicz, Piotr Kaczanowski, Szymon Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title | Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title_full | Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title_fullStr | Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title_full_unstemmed | Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title_short | Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures |
title_sort | using machine learning in accuracy assessment of knowledge-based energy and frequency base likelihood in protein structures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304015/ http://dx.doi.org/10.1007/978-3-030-50420-5_43 |
work_keys_str_mv | AT serafimovakaterina usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures AT mihayloviliyan usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures AT vassilevdimitar usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures AT avdjievairena usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures AT zielenkiewiczpiotr usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures AT kaczanowskiszymon usingmachinelearninginaccuracyassessmentofknowledgebasedenergyandfrequencybaselikelihoodinproteinstructures |