Cargando…

Benchmarking and refining probability-based models for nucleosome-DNA interaction

BACKGROUND: In investigations of nucleosome positioning preferences, a model that assigns an affinity to a given sequence is necessary to make predictions. One important class of models, which treats a nucleosome sequence as a Markov chain, has been applied with success when informed with experiment...

Descripción completa

Detalles Bibliográficos
Autores principales: Tompitak, Marco, Barkema, Gerard T., Schiessel, Helmut
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5341481/
https://www.ncbi.nlm.nih.gov/pubmed/28270095
http://dx.doi.org/10.1186/s12859-017-1569-0
_version_ 1782513000281800704
author Tompitak, Marco
Barkema, Gerard T.
Schiessel, Helmut
author_facet Tompitak, Marco
Barkema, Gerard T.
Schiessel, Helmut
author_sort Tompitak, Marco
collection PubMed
description BACKGROUND: In investigations of nucleosome positioning preferences, a model that assigns an affinity to a given sequence is necessary to make predictions. One important class of models, which treats a nucleosome sequence as a Markov chain, has been applied with success when informed with experimentally measured nucleosomal sequence preferences. RESULTS: We find that we can also use such models as a fast approximative scheme for computationally expensive biophysical models, vastly increasing their reach. Employing these models in this way also allows us to benchmark them for the first time. Doing so for the approximative in silico models indirectly tells us about the accuracy we can expect of them when applied to real data. CONCLUSION: We find that models presented in the literature should perform well, but this performance depends on factors such as the order of the Markov model, the preprocessing of the probability distributions on which the model is based, and the size and quality of the sequence ensemble from which those distributions are calculated. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1569-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5341481
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53414812017-03-10 Benchmarking and refining probability-based models for nucleosome-DNA interaction Tompitak, Marco Barkema, Gerard T. Schiessel, Helmut BMC Bioinformatics Methodology Article BACKGROUND: In investigations of nucleosome positioning preferences, a model that assigns an affinity to a given sequence is necessary to make predictions. One important class of models, which treats a nucleosome sequence as a Markov chain, has been applied with success when informed with experimentally measured nucleosomal sequence preferences. RESULTS: We find that we can also use such models as a fast approximative scheme for computationally expensive biophysical models, vastly increasing their reach. Employing these models in this way also allows us to benchmark them for the first time. Doing so for the approximative in silico models indirectly tells us about the accuracy we can expect of them when applied to real data. CONCLUSION: We find that models presented in the literature should perform well, but this performance depends on factors such as the order of the Markov model, the preprocessing of the probability distributions on which the model is based, and the size and quality of the sequence ensemble from which those distributions are calculated. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1569-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-07 /pmc/articles/PMC5341481/ /pubmed/28270095 http://dx.doi.org/10.1186/s12859-017-1569-0 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Tompitak, Marco
Barkema, Gerard T.
Schiessel, Helmut
Benchmarking and refining probability-based models for nucleosome-DNA interaction
title Benchmarking and refining probability-based models for nucleosome-DNA interaction
title_full Benchmarking and refining probability-based models for nucleosome-DNA interaction
title_fullStr Benchmarking and refining probability-based models for nucleosome-DNA interaction
title_full_unstemmed Benchmarking and refining probability-based models for nucleosome-DNA interaction
title_short Benchmarking and refining probability-based models for nucleosome-DNA interaction
title_sort benchmarking and refining probability-based models for nucleosome-dna interaction
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5341481/
https://www.ncbi.nlm.nih.gov/pubmed/28270095
http://dx.doi.org/10.1186/s12859-017-1569-0
work_keys_str_mv AT tompitakmarco benchmarkingandrefiningprobabilitybasedmodelsfornucleosomednainteraction
AT barkemagerardt benchmarkingandrefiningprobabilitybasedmodelsfornucleosomednainteraction
AT schiesselhelmut benchmarkingandrefiningprobabilitybasedmodelsfornucleosomednainteraction