Cargando…
Machine Learning Approaches for Quality Assessment of Protein Structures
Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226485/ https://www.ncbi.nlm.nih.gov/pubmed/32316682 http://dx.doi.org/10.3390/biom10040626 |
_version_ | 1783534299218182144 |
---|---|
author | Chen, Jiarui Siu, Shirley W. I. |
author_facet | Chen, Jiarui Siu, Shirley W. I. |
author_sort | Chen, Jiarui |
collection | PubMed |
description | Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of protein structures have not been perfected. Methods that assess the quality of protein models can help in selecting the most accurate candidates for further work. Driven by this demand, many structural bioinformatics laboratories have developed methods for estimating model accuracy (EMA). In recent years, EMA by machine learning (ML) have consistently ranked among the top-performing methods in the community-wide CASP challenge. Accordingly, we systematically review all the major ML-based EMA methods developed within the past ten years. The methods are grouped by their employed ML approach—support vector machine, artificial neural networks, ensemble learning, or Bayesian learning—and their significances are discussed from a methodology viewpoint. To orient the reader, we also briefly describe the background of EMA, including the CASP challenge and its evaluation metrics, and introduce the major ML/DL techniques. Overall, this review provides an introductory guide to modern research on protein quality assessment and directions for future research in this area. |
format | Online Article Text |
id | pubmed-7226485 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-72264852020-05-18 Machine Learning Approaches for Quality Assessment of Protein Structures Chen, Jiarui Siu, Shirley W. I. Biomolecules Review Protein structures play a very important role in biomedical research, especially in drug discovery and design, which require accurate protein structures in advance. However, experimental determinations of protein structure are prohibitively costly and time-consuming, and computational predictions of protein structures have not been perfected. Methods that assess the quality of protein models can help in selecting the most accurate candidates for further work. Driven by this demand, many structural bioinformatics laboratories have developed methods for estimating model accuracy (EMA). In recent years, EMA by machine learning (ML) have consistently ranked among the top-performing methods in the community-wide CASP challenge. Accordingly, we systematically review all the major ML-based EMA methods developed within the past ten years. The methods are grouped by their employed ML approach—support vector machine, artificial neural networks, ensemble learning, or Bayesian learning—and their significances are discussed from a methodology viewpoint. To orient the reader, we also briefly describe the background of EMA, including the CASP challenge and its evaluation metrics, and introduce the major ML/DL techniques. Overall, this review provides an introductory guide to modern research on protein quality assessment and directions for future research in this area. MDPI 2020-04-17 /pmc/articles/PMC7226485/ /pubmed/32316682 http://dx.doi.org/10.3390/biom10040626 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review Chen, Jiarui Siu, Shirley W. I. Machine Learning Approaches for Quality Assessment of Protein Structures |
title | Machine Learning Approaches for Quality Assessment of Protein Structures |
title_full | Machine Learning Approaches for Quality Assessment of Protein Structures |
title_fullStr | Machine Learning Approaches for Quality Assessment of Protein Structures |
title_full_unstemmed | Machine Learning Approaches for Quality Assessment of Protein Structures |
title_short | Machine Learning Approaches for Quality Assessment of Protein Structures |
title_sort | machine learning approaches for quality assessment of protein structures |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226485/ https://www.ncbi.nlm.nih.gov/pubmed/32316682 http://dx.doi.org/10.3390/biom10040626 |
work_keys_str_mv | AT chenjiarui machinelearningapproachesforqualityassessmentofproteinstructures AT siushirleywi machinelearningapproachesforqualityassessmentofproteinstructures |