Cargando…

Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering

Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extens...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yen-Lin, Pollack, Lois
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Union of Crystallography 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7467162/
https://www.ncbi.nlm.nih.gov/pubmed/32939279
http://dx.doi.org/10.1107/S2052252520008830
_version_ 1783577959296139264
author Chen, Yen-Lin
Pollack, Lois
author_facet Chen, Yen-Lin
Pollack, Lois
author_sort Chen, Yen-Lin
collection PubMed
description Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extension of measurements to wider scattering angles (WAXS or wide-angle X-ray scattering) can sharpen the resolution to below 10 Å, filling in structural details that can be critical for biological function. These WAXS profiles are especially challenging to interpret because of the significant contribution of solvent in addition to solute on these smaller length scales. Based on training with molecular dynamics generated models, the application of extreme gradient boosting (XGBoost) is discussed, which is a supervised machine learning (ML) approach to interpret features in solution scattering profiles. These ML methods are applied to predict key structural parameters of double-stranded ribonucleic acid (dsRNA) duplexes. Duplex conformations vary with salt and sequence and directly impact the foldability of functional RNA molecules. The strong structural periodicities in these duplexes yield scattering profiles with rich sets of features at intermediate-to-wide scattering angles. In the ML models, these profiles are treated as 1D images or features. These ML models identify specific scattering angles, or regions of scattering angles, which correspond with and successfully predict distinct structural parameters. Thus, this work demonstrates that ML strategies can integrate theoretical molecular models with experimental solution scattering data, providing a new framework for extracting highly relevant structural information from solution experiments on biological macromolecules.
format Online
Article
Text
id pubmed-7467162
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher International Union of Crystallography
record_format MEDLINE/PubMed
spelling pubmed-74671622020-09-15 Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering Chen, Yen-Lin Pollack, Lois IUCrJ Research Papers Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extension of measurements to wider scattering angles (WAXS or wide-angle X-ray scattering) can sharpen the resolution to below 10 Å, filling in structural details that can be critical for biological function. These WAXS profiles are especially challenging to interpret because of the significant contribution of solvent in addition to solute on these smaller length scales. Based on training with molecular dynamics generated models, the application of extreme gradient boosting (XGBoost) is discussed, which is a supervised machine learning (ML) approach to interpret features in solution scattering profiles. These ML methods are applied to predict key structural parameters of double-stranded ribonucleic acid (dsRNA) duplexes. Duplex conformations vary with salt and sequence and directly impact the foldability of functional RNA molecules. The strong structural periodicities in these duplexes yield scattering profiles with rich sets of features at intermediate-to-wide scattering angles. In the ML models, these profiles are treated as 1D images or features. These ML models identify specific scattering angles, or regions of scattering angles, which correspond with and successfully predict distinct structural parameters. Thus, this work demonstrates that ML strategies can integrate theoretical molecular models with experimental solution scattering data, providing a new framework for extracting highly relevant structural information from solution experiments on biological macromolecules. International Union of Crystallography 2020-08-12 /pmc/articles/PMC7467162/ /pubmed/32939279 http://dx.doi.org/10.1107/S2052252520008830 Text en © Chen and Pollack 2020 http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.http://creativecommons.org/licenses/by/4.0/
spellingShingle Research Papers
Chen, Yen-Lin
Pollack, Lois
Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title_full Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title_fullStr Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title_full_unstemmed Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title_short Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
title_sort machine learning deciphers structural features of rna duplexes measured with solution x-ray scattering
topic Research Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7467162/
https://www.ncbi.nlm.nih.gov/pubmed/32939279
http://dx.doi.org/10.1107/S2052252520008830
work_keys_str_mv AT chenyenlin machinelearningdeciphersstructuralfeaturesofrnaduplexesmeasuredwithsolutionxrayscattering
AT pollacklois machinelearningdeciphersstructuralfeaturesofrnaduplexesmeasuredwithsolutionxrayscattering