Cargando…

Using support vector machines to improve elemental ion identification in macromolecular crystal structures

In the process of macromolecular model building, crystallographers must examine electron density for isolated atoms and differentiate sites containing structured solvent molecules from those containing elemental ions. This task requires specific knowledge of metal-binding chemistry and scattering pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Morshed, Nader, Echols, Nathaniel, Adams, Paul D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Union of Crystallography 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4427199/
https://www.ncbi.nlm.nih.gov/pubmed/25945580
http://dx.doi.org/10.1107/S1399004715004241
_version_ 1782370690013331456
author Morshed, Nader
Echols, Nathaniel
Adams, Paul D.
author_facet Morshed, Nader
Echols, Nathaniel
Adams, Paul D.
author_sort Morshed, Nader
collection PubMed
description In the process of macromolecular model building, crystallographers must examine electron density for isolated atoms and differentiate sites containing structured solvent molecules from those containing elemental ions. This task requires specific knowledge of metal-binding chemistry and scattering properties and is prone to error. A method has previously been described to identify ions based on manually chosen criteria for a number of elements. Here, the use of support vector machines (SVMs) to automatically classify isolated atoms as either solvent or one of various ions is described. Two data sets of protein crystal structures, one containing manually curated structures deposited with anomalous diffraction data and another with automatically filtered, high-resolution structures, were constructed. On the manually curated data set, an SVM classifier was able to distinguish calcium from manganese, zinc, iron and nickel, as well as all five of these ions from water molecules, with a high degree of accuracy. Additionally, SVMs trained on the automatically curated set of high-resolution structures were able to successfully classify most common elemental ions in an independent validation test set. This method is readily extensible to other elemental ions and can also be used in conjunction with previous methods based on a priori expectations of the chemical environment and X-ray scattering.
format Online
Article
Text
id pubmed-4427199
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher International Union of Crystallography
record_format MEDLINE/PubMed
spelling pubmed-44271992015-05-25 Using support vector machines to improve elemental ion identification in macromolecular crystal structures Morshed, Nader Echols, Nathaniel Adams, Paul D. Acta Crystallogr D Biol Crystallogr Research Papers In the process of macromolecular model building, crystallographers must examine electron density for isolated atoms and differentiate sites containing structured solvent molecules from those containing elemental ions. This task requires specific knowledge of metal-binding chemistry and scattering properties and is prone to error. A method has previously been described to identify ions based on manually chosen criteria for a number of elements. Here, the use of support vector machines (SVMs) to automatically classify isolated atoms as either solvent or one of various ions is described. Two data sets of protein crystal structures, one containing manually curated structures deposited with anomalous diffraction data and another with automatically filtered, high-resolution structures, were constructed. On the manually curated data set, an SVM classifier was able to distinguish calcium from manganese, zinc, iron and nickel, as well as all five of these ions from water molecules, with a high degree of accuracy. Additionally, SVMs trained on the automatically curated set of high-resolution structures were able to successfully classify most common elemental ions in an independent validation test set. This method is readily extensible to other elemental ions and can also be used in conjunction with previous methods based on a priori expectations of the chemical environment and X-ray scattering. International Union of Crystallography 2015-04-25 /pmc/articles/PMC4427199/ /pubmed/25945580 http://dx.doi.org/10.1107/S1399004715004241 Text en © Morshed et al. 2015 http://creativecommons.org/licenses/by/2.0/uk/ This is an open-access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.
spellingShingle Research Papers
Morshed, Nader
Echols, Nathaniel
Adams, Paul D.
Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title_full Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title_fullStr Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title_full_unstemmed Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title_short Using support vector machines to improve elemental ion identification in macromolecular crystal structures
title_sort using support vector machines to improve elemental ion identification in macromolecular crystal structures
topic Research Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4427199/
https://www.ncbi.nlm.nih.gov/pubmed/25945580
http://dx.doi.org/10.1107/S1399004715004241
work_keys_str_mv AT morshednader usingsupportvectormachinestoimproveelementalionidentificationinmacromolecularcrystalstructures
AT echolsnathaniel usingsupportvectormachinestoimproveelementalionidentificationinmacromolecularcrystalstructures
AT adamspauld usingsupportvectormachinestoimproveelementalionidentificationinmacromolecularcrystalstructures