Cargando…

A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data

We propose a new model for predicting the retention time of oligonucleotides. The model is based on ν support vector regression using features derived from base sequence and predicted secondary structure of oligonucleotides. Because of the secondary structure information, the model is applicable eve...

Descripción completa

Detalles Bibliográficos
Autores principales: Sturm, Marc, Quinten, Sascha, Huber, Christian G., Kohlbacher, Oliver
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1919494/
https://www.ncbi.nlm.nih.gov/pubmed/17567619
http://dx.doi.org/10.1093/nar/gkm338
_version_ 1782134181700042752
author Sturm, Marc
Quinten, Sascha
Huber, Christian G.
Kohlbacher, Oliver
author_facet Sturm, Marc
Quinten, Sascha
Huber, Christian G.
Kohlbacher, Oliver
author_sort Sturm, Marc
collection PubMed
description We propose a new model for predicting the retention time of oligonucleotides. The model is based on ν support vector regression using features derived from base sequence and predicted secondary structure of oligonucleotides. Because of the secondary structure information, the model is applicable even at relatively low temperatures where the secondary structure is not suppressed by thermal denaturing. This makes the prediction of oligonucleotide retention time for arbitrary temperatures possible, provided that the target temperature lies within the temperature range of the training data. We describe different possibilities of feature calculation from base sequence and secondary structure, present the results and compare our model to existing models.
format Text
id pubmed-1919494
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-19194942007-07-24 A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data Sturm, Marc Quinten, Sascha Huber, Christian G. Kohlbacher, Oliver Nucleic Acids Res Computational Biology We propose a new model for predicting the retention time of oligonucleotides. The model is based on ν support vector regression using features derived from base sequence and predicted secondary structure of oligonucleotides. Because of the secondary structure information, the model is applicable even at relatively low temperatures where the secondary structure is not suppressed by thermal denaturing. This makes the prediction of oligonucleotide retention time for arbitrary temperatures possible, provided that the target temperature lies within the temperature range of the training data. We describe different possibilities of feature calculation from base sequence and secondary structure, present the results and compare our model to existing models. Oxford University Press 2007-06 2007-06-13 /pmc/articles/PMC1919494/ /pubmed/17567619 http://dx.doi.org/10.1093/nar/gkm338 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Sturm, Marc
Quinten, Sascha
Huber, Christian G.
Kohlbacher, Oliver
A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title_full A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title_fullStr A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title_full_unstemmed A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title_short A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
title_sort statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1919494/
https://www.ncbi.nlm.nih.gov/pubmed/17567619
http://dx.doi.org/10.1093/nar/gkm338
work_keys_str_mv AT sturmmarc astatisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT quintensascha astatisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT huberchristiang astatisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT kohlbacheroliver astatisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT sturmmarc statisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT quintensascha statisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT huberchristiang statisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata
AT kohlbacheroliver statisticallearningapproachtothemodelingofchromatographicretentionofoligonucleotidesincorporatingsequenceandsecondarystructuredata