Cargando…

Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping

Peak lists derived from nuclear magnetic resonance (NMR) spectra are commonly used as input data for a variety of computer assisted and automated analyses. These include automated protein resonance assignment and protein structure calculation software tools. Prior to these analyses, peak lists must...

Descripción completa

Detalles Bibliográficos
Autores principales: Smelter, Andrey, Rouchka, Eric C., Moseley, Hunter N. B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587626/
https://www.ncbi.nlm.nih.gov/pubmed/28815397
http://dx.doi.org/10.1007/s10858-017-0126-5
_version_ 1783262023456391168
author Smelter, Andrey
Rouchka, Eric C.
Moseley, Hunter N. B.
author_facet Smelter, Andrey
Rouchka, Eric C.
Moseley, Hunter N. B.
author_sort Smelter, Andrey
collection PubMed
description Peak lists derived from nuclear magnetic resonance (NMR) spectra are commonly used as input data for a variety of computer assisted and automated analyses. These include automated protein resonance assignment and protein structure calculation software tools. Prior to these analyses, peak lists must be aligned to each other and sets of related peaks must be grouped based on common chemical shift dimensions. Even when programs can perform peak grouping, they require the user to provide uniform match tolerances or use default values. However, peak grouping is further complicated by multiple sources of variance in peak position limiting the effectiveness of grouping methods that utilize uniform match tolerances. In addition, no method currently exists for deriving peak positional variances from single peak lists for grouping peaks into spin systems, i.e. spin system grouping within a single peak list. Therefore, we developed a complementary pair of peak list registration analysis and spin system grouping algorithms designed to overcome these limitations. We have implemented these algorithms into an approach that can identify multiple dimension-specific positional variances that exist in a single peak list and group peaks from a single peak list into spin systems. The resulting software tools generate a variety of useful statistics on both a single peak list and pairwise peak list alignment, especially for quality assessment of peak list datasets. We used a range of low and high quality experimental solution NMR and solid-state NMR peak lists to assess performance of our registration analysis and grouping algorithms. Analyses show that an algorithm using a single iteration and uniform match tolerances approach is only able to recover from 50 to 80% of the spin systems due to the presence of multiple sources of variance. Our algorithm recovers additional spin systems by reevaluating match tolerances in multiple iterations. To facilitate evaluation of the algorithms, we developed a peak list simulator within our nmrstarlib package that generates user-defined assigned peak lists from a given BMRB entry or database of entries. In addition, over 100,000 simulated peak lists with one or two sources of variance were generated to evaluate the performance and robustness of these new registration analysis and peak grouping algorithms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10858-017-0126-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5587626
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-55876262017-09-22 Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping Smelter, Andrey Rouchka, Eric C. Moseley, Hunter N. B. J Biomol NMR Article Peak lists derived from nuclear magnetic resonance (NMR) spectra are commonly used as input data for a variety of computer assisted and automated analyses. These include automated protein resonance assignment and protein structure calculation software tools. Prior to these analyses, peak lists must be aligned to each other and sets of related peaks must be grouped based on common chemical shift dimensions. Even when programs can perform peak grouping, they require the user to provide uniform match tolerances or use default values. However, peak grouping is further complicated by multiple sources of variance in peak position limiting the effectiveness of grouping methods that utilize uniform match tolerances. In addition, no method currently exists for deriving peak positional variances from single peak lists for grouping peaks into spin systems, i.e. spin system grouping within a single peak list. Therefore, we developed a complementary pair of peak list registration analysis and spin system grouping algorithms designed to overcome these limitations. We have implemented these algorithms into an approach that can identify multiple dimension-specific positional variances that exist in a single peak list and group peaks from a single peak list into spin systems. The resulting software tools generate a variety of useful statistics on both a single peak list and pairwise peak list alignment, especially for quality assessment of peak list datasets. We used a range of low and high quality experimental solution NMR and solid-state NMR peak lists to assess performance of our registration analysis and grouping algorithms. Analyses show that an algorithm using a single iteration and uniform match tolerances approach is only able to recover from 50 to 80% of the spin systems due to the presence of multiple sources of variance. Our algorithm recovers additional spin systems by reevaluating match tolerances in multiple iterations. To facilitate evaluation of the algorithms, we developed a peak list simulator within our nmrstarlib package that generates user-defined assigned peak lists from a given BMRB entry or database of entries. In addition, over 100,000 simulated peak lists with one or two sources of variance were generated to evaluate the performance and robustness of these new registration analysis and peak grouping algorithms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10858-017-0126-5) contains supplementary material, which is available to authorized users. Springer Netherlands 2017-08-16 2017 /pmc/articles/PMC5587626/ /pubmed/28815397 http://dx.doi.org/10.1007/s10858-017-0126-5 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Smelter, Andrey
Rouchka, Eric C.
Moseley, Hunter N. B.
Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title_full Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title_fullStr Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title_full_unstemmed Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title_short Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
title_sort detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587626/
https://www.ncbi.nlm.nih.gov/pubmed/28815397
http://dx.doi.org/10.1007/s10858-017-0126-5
work_keys_str_mv AT smelterandrey detectingandaccountingformultiplesourcesofpositionalvarianceinpeaklistregistrationanalysisandspinsystemgrouping
AT rouchkaericc detectingandaccountingformultiplesourcesofpositionalvarianceinpeaklistregistrationanalysisandspinsystemgrouping
AT moseleyhunternb detectingandaccountingformultiplesourcesofpositionalvarianceinpeaklistregistrationanalysisandspinsystemgrouping