Cargando…

A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides

Structural modifications of DNA and RNA molecules play a pivotal role in epigenetic and posttranscriptional regulation. To characterise these modifications, more and more MS and MS/MS- based tools for the analysis of nucleic acids are being developed. To identify an oligonucleotide in a mass spectru...

Descripción completa

Detalles Bibliográficos
Autores principales: Agten, Annelies, Prostko, Piotr, Geubbelmans, Melvin, Liu, Youzhong, De Vijlder, Thomas, Valkenborg, Dirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8234063/
https://www.ncbi.nlm.nih.gov/pubmed/34207227
http://dx.doi.org/10.3390/metabo11060400
_version_ 1783713996103221248
author Agten, Annelies
Prostko, Piotr
Geubbelmans, Melvin
Liu, Youzhong
De Vijlder, Thomas
Valkenborg, Dirk
author_facet Agten, Annelies
Prostko, Piotr
Geubbelmans, Melvin
Liu, Youzhong
De Vijlder, Thomas
Valkenborg, Dirk
author_sort Agten, Annelies
collection PubMed
description Structural modifications of DNA and RNA molecules play a pivotal role in epigenetic and posttranscriptional regulation. To characterise these modifications, more and more MS and MS/MS- based tools for the analysis of nucleic acids are being developed. To identify an oligonucleotide in a mass spectrum, it is useful to compare the obtained isotope pattern of the molecule of interest to the one that is theoretically expected based on its elemental composition. However, this is not straightforward when the identity of the molecule under investigation is unknown. Here, we present a modelling approach for the prediction of the aggregated isotope distribution of an average DNA or RNA molecule when a particular (monoisotopic) mass is available. For this purpose, a theoretical database of all possible DNA/RNA oligonucleotides up to a mass of 25 kDa is created, and the aggregated isotope distribution for the entire database of oligonucleotides is generated using the BRAIN algorithm. Since this isotope information is compositional in nature, the modelling method is based on the additive log-ratio analysis of Aitchison. As a result, a univariate weighted polynomial regression model of order 10 is fitted to predict the first 20 isotope peaks for DNA and RNA molecules. The performance of the prediction model is assessed by using a mean squared error approach and a modified Pearson’s χ(2) goodness-of-fit measure on experimental data. Our analysis has indicated that the variability in spectral accuracy contributed more to the errors than the approximation of the theoretical isotope distribution by our proposed average DNA/RNA model. The prediction model is implemented as an online tool. An R function can be downloaded to incorporate the method in custom analysis workflows to process mass spectral data.
format Online
Article
Text
id pubmed-8234063
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82340632021-06-27 A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides Agten, Annelies Prostko, Piotr Geubbelmans, Melvin Liu, Youzhong De Vijlder, Thomas Valkenborg, Dirk Metabolites Article Structural modifications of DNA and RNA molecules play a pivotal role in epigenetic and posttranscriptional regulation. To characterise these modifications, more and more MS and MS/MS- based tools for the analysis of nucleic acids are being developed. To identify an oligonucleotide in a mass spectrum, it is useful to compare the obtained isotope pattern of the molecule of interest to the one that is theoretically expected based on its elemental composition. However, this is not straightforward when the identity of the molecule under investigation is unknown. Here, we present a modelling approach for the prediction of the aggregated isotope distribution of an average DNA or RNA molecule when a particular (monoisotopic) mass is available. For this purpose, a theoretical database of all possible DNA/RNA oligonucleotides up to a mass of 25 kDa is created, and the aggregated isotope distribution for the entire database of oligonucleotides is generated using the BRAIN algorithm. Since this isotope information is compositional in nature, the modelling method is based on the additive log-ratio analysis of Aitchison. As a result, a univariate weighted polynomial regression model of order 10 is fitted to predict the first 20 isotope peaks for DNA and RNA molecules. The performance of the prediction model is assessed by using a mean squared error approach and a modified Pearson’s χ(2) goodness-of-fit measure on experimental data. Our analysis has indicated that the variability in spectral accuracy contributed more to the errors than the approximation of the theoretical isotope distribution by our proposed average DNA/RNA model. The prediction model is implemented as an online tool. An R function can be downloaded to incorporate the method in custom analysis workflows to process mass spectral data. MDPI 2021-06-18 /pmc/articles/PMC8234063/ /pubmed/34207227 http://dx.doi.org/10.3390/metabo11060400 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Agten, Annelies
Prostko, Piotr
Geubbelmans, Melvin
Liu, Youzhong
De Vijlder, Thomas
Valkenborg, Dirk
A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title_full A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title_fullStr A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title_full_unstemmed A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title_short A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
title_sort compositional model to predict the aggregated isotope distribution for average dna and rna oligonucleotides
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8234063/
https://www.ncbi.nlm.nih.gov/pubmed/34207227
http://dx.doi.org/10.3390/metabo11060400
work_keys_str_mv AT agtenannelies acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT prostkopiotr acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT geubbelmansmelvin acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT liuyouzhong acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT devijlderthomas acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT valkenborgdirk acompositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT agtenannelies compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT prostkopiotr compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT geubbelmansmelvin compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT liuyouzhong compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT devijlderthomas compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides
AT valkenborgdirk compositionalmodeltopredicttheaggregatedisotopedistributionforaveragednaandrnaoligonucleotides