Cargando…

Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo

Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segm...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Tong, Fan, Xiaodan, Li, Qiwei, Li, Shuo-yen R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3479165/
https://www.ncbi.nlm.nih.gov/pubmed/22753023
http://dx.doi.org/10.1093/nar/gks644
_version_ 1782247417121341440
author Liang, Tong
Fan, Xiaodan
Li, Qiwei
Li, Shuo-yen R.
author_facet Liang, Tong
Fan, Xiaodan
Li, Qiwei
Li, Shuo-yen R.
author_sort Liang, Tong
collection PubMed
description Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segments of the same pattern are dispersively distributed in a sequence. We construct a probabilistic generative model for the tandem repeats, where the sequence pattern is represented by a motif matrix. A Bayesian approach is adopted to compute this model. Markov chain Monte Carlo (MCMC) algorithms are used to explore the posterior distribution as an effort to infer both the motif matrix of tandem repeats and the location of repeat segments. Reversible jump Markov chain Monte Carlo (RJMCMC) algorithms are used to address the transdimensional model selection problem raised by the variable number of repeat segments. Experiments on both synthetic data and real data show that this new approach is powerful in detecting dispersed short tandem repeats. As far as we know, it is the first work to adopt RJMCMC algorithms in the detection of tandem repeats.
format Online
Article
Text
id pubmed-3479165
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34791652012-10-24 Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo Liang, Tong Fan, Xiaodan Li, Qiwei Li, Shuo-yen R. Nucleic Acids Res Methods Online Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segments of the same pattern are dispersively distributed in a sequence. We construct a probabilistic generative model for the tandem repeats, where the sequence pattern is represented by a motif matrix. A Bayesian approach is adopted to compute this model. Markov chain Monte Carlo (MCMC) algorithms are used to explore the posterior distribution as an effort to infer both the motif matrix of tandem repeats and the location of repeat segments. Reversible jump Markov chain Monte Carlo (RJMCMC) algorithms are used to address the transdimensional model selection problem raised by the variable number of repeat segments. Experiments on both synthetic data and real data show that this new approach is powerful in detecting dispersed short tandem repeats. As far as we know, it is the first work to adopt RJMCMC algorithms in the detection of tandem repeats. Oxford University Press 2012-10 2012-06-28 /pmc/articles/PMC3479165/ /pubmed/22753023 http://dx.doi.org/10.1093/nar/gks644 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Liang, Tong
Fan, Xiaodan
Li, Qiwei
Li, Shuo-yen R.
Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title_full Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title_fullStr Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title_full_unstemmed Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title_short Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
title_sort detection of dispersed short tandem repeats using reversible jump markov chain monte carlo
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3479165/
https://www.ncbi.nlm.nih.gov/pubmed/22753023
http://dx.doi.org/10.1093/nar/gks644
work_keys_str_mv AT liangtong detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo
AT fanxiaodan detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo
AT liqiwei detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo
AT lishuoyenr detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo