Cargando…
Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo
Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segm...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3479165/ https://www.ncbi.nlm.nih.gov/pubmed/22753023 http://dx.doi.org/10.1093/nar/gks644 |
_version_ | 1782247417121341440 |
---|---|
author | Liang, Tong Fan, Xiaodan Li, Qiwei Li, Shuo-yen R. |
author_facet | Liang, Tong Fan, Xiaodan Li, Qiwei Li, Shuo-yen R. |
author_sort | Liang, Tong |
collection | PubMed |
description | Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segments of the same pattern are dispersively distributed in a sequence. We construct a probabilistic generative model for the tandem repeats, where the sequence pattern is represented by a motif matrix. A Bayesian approach is adopted to compute this model. Markov chain Monte Carlo (MCMC) algorithms are used to explore the posterior distribution as an effort to infer both the motif matrix of tandem repeats and the location of repeat segments. Reversible jump Markov chain Monte Carlo (RJMCMC) algorithms are used to address the transdimensional model selection problem raised by the variable number of repeat segments. Experiments on both synthetic data and real data show that this new approach is powerful in detecting dispersed short tandem repeats. As far as we know, it is the first work to adopt RJMCMC algorithms in the detection of tandem repeats. |
format | Online Article Text |
id | pubmed-3479165 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34791652012-10-24 Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo Liang, Tong Fan, Xiaodan Li, Qiwei Li, Shuo-yen R. Nucleic Acids Res Methods Online Tandem repeats occur frequently in biological sequences. They are important for studying genome evolution and human disease. A number of methods have been designed to detect a single tandem repeat in a sliding window. In this article, we focus on the case that an unknown number of tandem repeat segments of the same pattern are dispersively distributed in a sequence. We construct a probabilistic generative model for the tandem repeats, where the sequence pattern is represented by a motif matrix. A Bayesian approach is adopted to compute this model. Markov chain Monte Carlo (MCMC) algorithms are used to explore the posterior distribution as an effort to infer both the motif matrix of tandem repeats and the location of repeat segments. Reversible jump Markov chain Monte Carlo (RJMCMC) algorithms are used to address the transdimensional model selection problem raised by the variable number of repeat segments. Experiments on both synthetic data and real data show that this new approach is powerful in detecting dispersed short tandem repeats. As far as we know, it is the first work to adopt RJMCMC algorithms in the detection of tandem repeats. Oxford University Press 2012-10 2012-06-28 /pmc/articles/PMC3479165/ /pubmed/22753023 http://dx.doi.org/10.1093/nar/gks644 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Liang, Tong Fan, Xiaodan Li, Qiwei Li, Shuo-yen R. Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title | Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title_full | Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title_fullStr | Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title_full_unstemmed | Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title_short | Detection of dispersed short tandem repeats using reversible jump Markov chain Monte Carlo |
title_sort | detection of dispersed short tandem repeats using reversible jump markov chain monte carlo |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3479165/ https://www.ncbi.nlm.nih.gov/pubmed/22753023 http://dx.doi.org/10.1093/nar/gks644 |
work_keys_str_mv | AT liangtong detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo AT fanxiaodan detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo AT liqiwei detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo AT lishuoyenr detectionofdispersedshorttandemrepeatsusingreversiblejumpmarkovchainmontecarlo |