Cargando…
A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model va...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6789601/ https://www.ncbi.nlm.nih.gov/pubmed/31390767 http://dx.doi.org/10.3390/medicines6030086 |
_version_ | 1783458657232486400 |
---|---|
author | Chen, Lin Baker, Brandon Santos, Eduardo Sheep, Michell Daftarian, Darius |
author_facet | Chen, Lin Baker, Brandon Santos, Eduardo Sheep, Michell Daftarian, Darius |
author_sort | Chen, Lin |
collection | PubMed |
description | Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model validation system lacks identification features for cryo-EM proteins making it not enough to identify outliers in cryo-EM proteins. Methods: This study introduces an efficient unsupervised outlier detection model for validating protein models built from cryo-EM technique. The current model uses a high-resolution X-ray dataset (<1.5 Å) as the reference dataset. The distal block distance, side-chain length, phi, psi, and first chi angle of the residues in the reference dataset are collected and saved as a database of the histogram-based outlier score (HBOS). The HBOS value of the residues in target cryo-EM proteins can be read from this HBOS database. Results: Protein residues with a HBOS value greater than ten are labeled as outliers by default. Four datasets containing proteins derived from cryo-EM density maps were tested with this probabilistic anomaly detection model. Conclusions: According to the proposed model, a visualization assistant tool was designed for Chimera, a protein visualization platform. |
format | Online Article Text |
id | pubmed-6789601 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-67896012019-10-16 A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform Chen, Lin Baker, Brandon Santos, Eduardo Sheep, Michell Daftarian, Darius Medicines (Basel) Article Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model validation system lacks identification features for cryo-EM proteins making it not enough to identify outliers in cryo-EM proteins. Methods: This study introduces an efficient unsupervised outlier detection model for validating protein models built from cryo-EM technique. The current model uses a high-resolution X-ray dataset (<1.5 Å) as the reference dataset. The distal block distance, side-chain length, phi, psi, and first chi angle of the residues in the reference dataset are collected and saved as a database of the histogram-based outlier score (HBOS). The HBOS value of the residues in target cryo-EM proteins can be read from this HBOS database. Results: Protein residues with a HBOS value greater than ten are labeled as outliers by default. Four datasets containing proteins derived from cryo-EM density maps were tested with this probabilistic anomaly detection model. Conclusions: According to the proposed model, a visualization assistant tool was designed for Chimera, a protein visualization platform. MDPI 2019-08-06 /pmc/articles/PMC6789601/ /pubmed/31390767 http://dx.doi.org/10.3390/medicines6030086 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chen, Lin Baker, Brandon Santos, Eduardo Sheep, Michell Daftarian, Darius A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title | A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title_full | A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title_fullStr | A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title_full_unstemmed | A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title_short | A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform |
title_sort | visualization tool for cryo-em protein validation with an unsupervised machine learning model in chimera platform |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6789601/ https://www.ncbi.nlm.nih.gov/pubmed/31390767 http://dx.doi.org/10.3390/medicines6030086 |
work_keys_str_mv | AT chenlin avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT bakerbrandon avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT santoseduardo avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT sheepmichell avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT daftariandarius avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT chenlin visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT bakerbrandon visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT santoseduardo visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT sheepmichell visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform AT daftariandarius visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform |