Cargando…
Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering
Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extens...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Union of Crystallography
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7467162/ https://www.ncbi.nlm.nih.gov/pubmed/32939279 http://dx.doi.org/10.1107/S2052252520008830 |
_version_ | 1783577959296139264 |
---|---|
author | Chen, Yen-Lin Pollack, Lois |
author_facet | Chen, Yen-Lin Pollack, Lois |
author_sort | Chen, Yen-Lin |
collection | PubMed |
description | Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extension of measurements to wider scattering angles (WAXS or wide-angle X-ray scattering) can sharpen the resolution to below 10 Å, filling in structural details that can be critical for biological function. These WAXS profiles are especially challenging to interpret because of the significant contribution of solvent in addition to solute on these smaller length scales. Based on training with molecular dynamics generated models, the application of extreme gradient boosting (XGBoost) is discussed, which is a supervised machine learning (ML) approach to interpret features in solution scattering profiles. These ML methods are applied to predict key structural parameters of double-stranded ribonucleic acid (dsRNA) duplexes. Duplex conformations vary with salt and sequence and directly impact the foldability of functional RNA molecules. The strong structural periodicities in these duplexes yield scattering profiles with rich sets of features at intermediate-to-wide scattering angles. In the ML models, these profiles are treated as 1D images or features. These ML models identify specific scattering angles, or regions of scattering angles, which correspond with and successfully predict distinct structural parameters. Thus, this work demonstrates that ML strategies can integrate theoretical molecular models with experimental solution scattering data, providing a new framework for extracting highly relevant structural information from solution experiments on biological macromolecules. |
format | Online Article Text |
id | pubmed-7467162 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | International Union of Crystallography |
record_format | MEDLINE/PubMed |
spelling | pubmed-74671622020-09-15 Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering Chen, Yen-Lin Pollack, Lois IUCrJ Research Papers Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Ångstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extension of measurements to wider scattering angles (WAXS or wide-angle X-ray scattering) can sharpen the resolution to below 10 Å, filling in structural details that can be critical for biological function. These WAXS profiles are especially challenging to interpret because of the significant contribution of solvent in addition to solute on these smaller length scales. Based on training with molecular dynamics generated models, the application of extreme gradient boosting (XGBoost) is discussed, which is a supervised machine learning (ML) approach to interpret features in solution scattering profiles. These ML methods are applied to predict key structural parameters of double-stranded ribonucleic acid (dsRNA) duplexes. Duplex conformations vary with salt and sequence and directly impact the foldability of functional RNA molecules. The strong structural periodicities in these duplexes yield scattering profiles with rich sets of features at intermediate-to-wide scattering angles. In the ML models, these profiles are treated as 1D images or features. These ML models identify specific scattering angles, or regions of scattering angles, which correspond with and successfully predict distinct structural parameters. Thus, this work demonstrates that ML strategies can integrate theoretical molecular models with experimental solution scattering data, providing a new framework for extracting highly relevant structural information from solution experiments on biological macromolecules. International Union of Crystallography 2020-08-12 /pmc/articles/PMC7467162/ /pubmed/32939279 http://dx.doi.org/10.1107/S2052252520008830 Text en © Chen and Pollack 2020 http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Research Papers Chen, Yen-Lin Pollack, Lois Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title | Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title_full | Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title_fullStr | Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title_full_unstemmed | Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title_short | Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering |
title_sort | machine learning deciphers structural features of rna duplexes measured with solution x-ray scattering |
topic | Research Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7467162/ https://www.ncbi.nlm.nih.gov/pubmed/32939279 http://dx.doi.org/10.1107/S2052252520008830 |
work_keys_str_mv | AT chenyenlin machinelearningdeciphersstructuralfeaturesofrnaduplexesmeasuredwithsolutionxrayscattering AT pollacklois machinelearningdeciphersstructuralfeaturesofrnaduplexesmeasuredwithsolutionxrayscattering |