Cargando…

Machine Learning Approaches for Quality Assessment of Protein Structures

Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jiarui, Siu, Shirley W. I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226485/
https://www.ncbi.nlm.nih.gov/pubmed/32316682
http://dx.doi.org/10.3390/biom10040626
_version_ 1783534299218182144
author Chen, Jiarui
Siu, Shirley W. I.
author_facet Chen, Jiarui
Siu, Shirley W. I.
author_sort Chen, Jiarui
collection PubMed
description Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of protein structures have not been perfected. Methods that assess the quality of protein models can help in selecting the most accurate candidates for further work. Driven by this demand, many structural bioinformatics laboratories have developed methods for estimating model accuracy (EMA). In recent years, EMA by machine learning (ML) have consistently ranked among the top-performing methods in the community-wide CASP challenge. Accordingly, we systematically review all the major ML-based EMA methods developed within the past ten years. The methods are grouped by their employed ML approach—support vector machine, artificial neural networks, ensemble learning, or Bayesian learning—and their significances are discussed from a methodology viewpoint. To orient the reader, we also briefly describe the background of EMA, including the CASP challenge and its evaluation metrics, and introduce the major ML/DL techniques. Overall, this review provides an introductory guide to modern research on protein quality assessment and directions for future research in this area.
format Online
Article
Text
id pubmed-7226485
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-72264852020-05-18 Machine Learning Approaches for Quality Assessment of Protein Structures Chen, Jiarui Siu, Shirley W. I. Biomolecules Review Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of protein structures have not been perfected. Methods that assess the quality of protein models can help in selecting the most accurate candidates for further work. Driven by this demand, many structural bioinformatics laboratories have developed methods for estimating model accuracy (EMA). In recent years, EMA by machine learning (ML) have consistently ranked among the top-performing methods in the community-wide CASP challenge. Accordingly, we systematically review all the major ML-based EMA methods developed within the past ten years. The methods are grouped by their employed ML approach—support vector machine, artificial neural networks, ensemble learning, or Bayesian learning—and their significances are discussed from a methodology viewpoint. To orient the reader, we also briefly describe the background of EMA, including the CASP challenge and its evaluation metrics, and introduce the major ML/DL techniques. Overall, this review provides an introductory guide to modern research on protein quality assessment and directions for future research in this area. MDPI 2020-04-17 /pmc/articles/PMC7226485/ /pubmed/32316682 http://dx.doi.org/10.3390/biom10040626 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Chen, Jiarui
Siu, Shirley W. I.
Machine Learning Approaches for Quality Assessment of Protein Structures
title Machine Learning Approaches for Quality Assessment of Protein Structures
title_full Machine Learning Approaches for Quality Assessment of Protein Structures
title_fullStr Machine Learning Approaches for Quality Assessment of Protein Structures
title_full_unstemmed Machine Learning Approaches for Quality Assessment of Protein Structures
title_short Machine Learning Approaches for Quality Assessment of Protein Structures
title_sort machine learning approaches for quality assessment of protein structures
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226485/
https://www.ncbi.nlm.nih.gov/pubmed/32316682
http://dx.doi.org/10.3390/biom10040626
work_keys_str_mv AT chenjiarui machinelearningapproachesforqualityassessmentofproteinstructures
AT siushirleywi machinelearningapproachesforqualityassessmentofproteinstructures