Cargando…

A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform

Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model va...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lin, Baker, Brandon, Santos, Eduardo, Sheep, Michell, Daftarian, Darius
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6789601/
https://www.ncbi.nlm.nih.gov/pubmed/31390767
http://dx.doi.org/10.3390/medicines6030086
_version_ 1783458657232486400
author Chen, Lin
Baker, Brandon
Santos, Eduardo
Sheep, Michell
Daftarian, Darius
author_facet Chen, Lin
Baker, Brandon
Santos, Eduardo
Sheep, Michell
Daftarian, Darius
author_sort Chen, Lin
collection PubMed
description Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model validation system lacks identification features for cryo-EM proteins making it not enough to identify outliers in cryo-EM proteins. Methods: This study introduces an efficient unsupervised outlier detection model for validating protein models built from cryo-EM technique. The current model uses a high-resolution X-ray dataset (<1.5 Å) as the reference dataset. The distal block distance, side-chain length, phi, psi, and first chi angle of the residues in the reference dataset are collected and saved as a database of the histogram-based outlier score (HBOS). The HBOS value of the residues in target cryo-EM proteins can be read from this HBOS database. Results: Protein residues with a HBOS value greater than ten are labeled as outliers by default. Four datasets containing proteins derived from cryo-EM density maps were tested with this probabilistic anomaly detection model. Conclusions: According to the proposed model, a visualization assistant tool was designed for Chimera, a protein visualization platform.
format Online
Article
Text
id pubmed-6789601
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-67896012019-10-16 A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform Chen, Lin Baker, Brandon Santos, Eduardo Sheep, Michell Daftarian, Darius Medicines (Basel) Article Background: Cryo-electron microscopy (cryo-EM) has become a major technique for protein structure determination. However, due to the low quality of cryo-EM density maps, many protein structures derived from cryo-EM contain outliers introduced during the modeling process. The current protein model validation system lacks identification features for cryo-EM proteins making it not enough to identify outliers in cryo-EM proteins. Methods: This study introduces an efficient unsupervised outlier detection model for validating protein models built from cryo-EM technique. The current model uses a high-resolution X-ray dataset (<1.5 Å) as the reference dataset. The distal block distance, side-chain length, phi, psi, and first chi angle of the residues in the reference dataset are collected and saved as a database of the histogram-based outlier score (HBOS). The HBOS value of the residues in target cryo-EM proteins can be read from this HBOS database. Results: Protein residues with a HBOS value greater than ten are labeled as outliers by default. Four datasets containing proteins derived from cryo-EM density maps were tested with this probabilistic anomaly detection model. Conclusions: According to the proposed model, a visualization assistant tool was designed for Chimera, a protein visualization platform. MDPI 2019-08-06 /pmc/articles/PMC6789601/ /pubmed/31390767 http://dx.doi.org/10.3390/medicines6030086 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Lin
Baker, Brandon
Santos, Eduardo
Sheep, Michell
Daftarian, Darius
A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title_full A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title_fullStr A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title_full_unstemmed A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title_short A Visualization Tool for Cryo-EM Protein Validation with an Unsupervised Machine Learning Model in Chimera Platform
title_sort visualization tool for cryo-em protein validation with an unsupervised machine learning model in chimera platform
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6789601/
https://www.ncbi.nlm.nih.gov/pubmed/31390767
http://dx.doi.org/10.3390/medicines6030086
work_keys_str_mv AT chenlin avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT bakerbrandon avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT santoseduardo avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT sheepmichell avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT daftariandarius avisualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT chenlin visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT bakerbrandon visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT santoseduardo visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT sheepmichell visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform
AT daftariandarius visualizationtoolforcryoemproteinvalidationwithanunsupervisedmachinelearningmodelinchimeraplatform