Cargando…
A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from con...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2951085/ https://www.ncbi.nlm.nih.gov/pubmed/20871106 http://dx.doi.org/10.1093/bioinformatics/btq460 |
_version_ | 1782187683675635712 |
---|---|
author | Wang, Jianrong Huda, Ahsan Lunyak, Victoria V. Jordan, I. King |
author_facet | Wang, Jianrong Huda, Ahsan Lunyak, Victoria V. Jordan, I. King |
author_sort | Wang, Jianrong |
collection | PubMed |
description | Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from consideration resulting in a potential loss of important biological information. Results: We have developed a Gibbs sampling-based algorithm for the genomic mapping of ambiguous sequence tags. Our algorithm relies on the local genomic tag context to guide the mapping of ambiguous tags. The Gibbs sampling procedure we use simultaneously maps ambiguous tags and updates the probabilities used to infer correct tag map positions. We show that our algorithm is able to correctly map more ambiguous tags than existing mapping methods. Our approach is also able to uncover mapped genomic sites from highly repetitive sequences that can not be detected based on unique tags alone, including transposable elements, segmental duplications and peri-centromeric regions. This mapping approach should prove to be useful for increasing biological knowledge on the too often neglected repetitive genomic regions. Availability: http://esbg.gatech.edu/jordan/software/map Contact: king.jordan@biology.gatech.edu Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2951085 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29510852010-10-08 A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags Wang, Jianrong Huda, Ahsan Lunyak, Victoria V. Jordan, I. King Bioinformatics Original Paper Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from consideration resulting in a potential loss of important biological information. Results: We have developed a Gibbs sampling-based algorithm for the genomic mapping of ambiguous sequence tags. Our algorithm relies on the local genomic tag context to guide the mapping of ambiguous tags. The Gibbs sampling procedure we use simultaneously maps ambiguous tags and updates the probabilities used to infer correct tag map positions. We show that our algorithm is able to correctly map more ambiguous tags than existing mapping methods. Our approach is also able to uncover mapped genomic sites from highly repetitive sequences that can not be detected based on unique tags alone, including transposable elements, segmental duplications and peri-centromeric regions. This mapping approach should prove to be useful for increasing biological knowledge on the too often neglected repetitive genomic regions. Availability: http://esbg.gatech.edu/jordan/software/map Contact: king.jordan@biology.gatech.edu Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-10-15 2010-09-24 /pmc/articles/PMC2951085/ /pubmed/20871106 http://dx.doi.org/10.1093/bioinformatics/btq460 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Wang, Jianrong Huda, Ahsan Lunyak, Victoria V. Jordan, I. King A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title | A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title_full | A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title_fullStr | A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title_full_unstemmed | A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title_short | A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
title_sort | gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2951085/ https://www.ncbi.nlm.nih.gov/pubmed/20871106 http://dx.doi.org/10.1093/bioinformatics/btq460 |
work_keys_str_mv | AT wangjianrong agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT hudaahsan agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT lunyakvictoriav agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT jordaniking agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT wangjianrong gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT hudaahsan gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT lunyakvictoriav gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags AT jordaniking gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags |