Cargando…

The R-factor gap in macromolecular crystallography: an untapped potential for insights on accurate structures

In macromolecular crystallography, the agreement between observed and predicted structure factors (R(cryst) and R(free)) is seldom better than 20%. This is much larger than the estimate of experimental error (R(merge)). The difference between R(cryst) and R(merge) is the R-factor gap. There is no su...

Descripción completa

Detalles Bibliográficos
Autores principales: Holton, James M, Classen, Scott, Frankel, Kenneth A, Tainer, John A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4282448/
https://www.ncbi.nlm.nih.gov/pubmed/25040949
http://dx.doi.org/10.1111/febs.12922
Descripción
Sumario:In macromolecular crystallography, the agreement between observed and predicted structure factors (R(cryst) and R(free)) is seldom better than 20%. This is much larger than the estimate of experimental error (R(merge)). The difference between R(cryst) and R(merge) is the R-factor gap. There is no such gap in small-molecule crystallography, for which calculated structure factors are generally considered more accurate than the experimental measurements. Perhaps the true noise level of macromolecular data is higher than expected? Or is the gap caused by inaccurate phases that trap refined models in local minima? By generating simulated diffraction patterns using the program MLFSOM, and including every conceivable source of experimental error, we show that neither is the case. Processing our simulated data yielded values that were indistinguishable from those of real data for all crystallographic statistics except the final R(cryst) and R(free). These values decreased to 3.8% and 5.5% for simulated data, suggesting that the reason for high R-factors in macromolecular crystallography is neither experimental error nor phase bias, but rather an underlying inadequacy in the models used to explain our observations. The present inability to accurately represent the entire macromolecule with both its flexibility and its protein-solvent interface may be improved by synergies between small-angle X-ray scattering, computational chemistry and crystallography. The exciting implication of our finding is that macromolecular data contain substantial hidden and untapped potential to resolve ambiguities in the true nature of the nanoscale, a task that the second century of crystallography promises to fulfill. DATABASE: Coordinates and structure factors for the real data have been submitted to the Protein Data Bank under accession 4tws.