Cargando…

Vagabond: bond-based parametrization reduces overfitting for refinement of proteins

Structural biology methods have delivered over 150 000 high-resolution structures of macromolecules, which have fundamentally altered our understanding of biology and our approach to developing new medicines. However, the description of molecular flexibility is instrinsically flawed and in almost al...

Descripción completa

Detalles Bibliográficos
Autor principal: Ginn, Helen M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Union of Crystallography 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8025884/
https://www.ncbi.nlm.nih.gov/pubmed/33825703
http://dx.doi.org/10.1107/S2059798321000826
_version_ 1783675574498099200
author Ginn, Helen M.
author_facet Ginn, Helen M.
author_sort Ginn, Helen M.
collection PubMed
description Structural biology methods have delivered over 150 000 high-resolution structures of macromolecules, which have fundamentally altered our understanding of biology and our approach to developing new medicines. However, the description of molecular flexibility is instrinsically flawed and in almost all cases, regardless of the experimental method used for structure determination, there remains a strong overfitting bias during molecular model building and refinement. In the worst case this can lead to wholly incorrect structures and thus incorrect biological interpretations. Here, by reparametrizing the description of these complex structures in terms of bonds rather than atomic positions, and by modelling flexibility using a deterministic ensemble of structures, it is demonstrated that structures can be described using fewer parameters than in conventional refinement. The current implementation, applied to X-ray diffraction data, significantly reduces the extent of overfitting, allowing the experimental data to reveal more biological information in electron-density maps.
format Online
Article
Text
id pubmed-8025884
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher International Union of Crystallography
record_format MEDLINE/PubMed
spelling pubmed-80258842021-04-30 Vagabond: bond-based parametrization reduces overfitting for refinement of proteins Ginn, Helen M. Acta Crystallogr D Struct Biol Ccp4 Structural biology methods have delivered over 150 000 high-resolution structures of macromolecules, which have fundamentally altered our understanding of biology and our approach to developing new medicines. However, the description of molecular flexibility is instrinsically flawed and in almost all cases, regardless of the experimental method used for structure determination, there remains a strong overfitting bias during molecular model building and refinement. In the worst case this can lead to wholly incorrect structures and thus incorrect biological interpretations. Here, by reparametrizing the description of these complex structures in terms of bonds rather than atomic positions, and by modelling flexibility using a deterministic ensemble of structures, it is demonstrated that structures can be described using fewer parameters than in conventional refinement. The current implementation, applied to X-ray diffraction data, significantly reduces the extent of overfitting, allowing the experimental data to reveal more biological information in electron-density maps. International Union of Crystallography 2021-03-30 /pmc/articles/PMC8025884/ /pubmed/33825703 http://dx.doi.org/10.1107/S2059798321000826 Text en © Ginn 2021 http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.http://creativecommons.org/licenses/by/4.0/
spellingShingle Ccp4
Ginn, Helen M.
Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title_full Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title_fullStr Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title_full_unstemmed Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title_short Vagabond: bond-based parametrization reduces overfitting for refinement of proteins
title_sort vagabond: bond-based parametrization reduces overfitting for refinement of proteins
topic Ccp4
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8025884/
https://www.ncbi.nlm.nih.gov/pubmed/33825703
http://dx.doi.org/10.1107/S2059798321000826
work_keys_str_mv AT ginnhelenm vagabondbondbasedparametrizationreducesoverfittingforrefinementofproteins