Cargando…

A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags

Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from con...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jianrong, Huda, Ahsan, Lunyak, Victoria V., Jordan, I. King
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2951085/
https://www.ncbi.nlm.nih.gov/pubmed/20871106
http://dx.doi.org/10.1093/bioinformatics/btq460
_version_ 1782187683675635712
author Wang, Jianrong
Huda, Ahsan
Lunyak, Victoria V.
Jordan, I. King
author_facet Wang, Jianrong
Huda, Ahsan
Lunyak, Victoria V.
Jordan, I. King
author_sort Wang, Jianrong
collection PubMed
description Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from consideration resulting in a potential loss of important biological information. Results: We have developed a Gibbs sampling-based algorithm for the genomic mapping of ambiguous sequence tags. Our algorithm relies on the local genomic tag context to guide the mapping of ambiguous tags. The Gibbs sampling procedure we use simultaneously maps ambiguous tags and updates the probabilities used to infer correct tag map positions. We show that our algorithm is able to correctly map more ambiguous tags than existing mapping methods. Our approach is also able to uncover mapped genomic sites from highly repetitive sequences that can not be detected based on unique tags alone, including transposable elements, segmental duplications and peri-centromeric regions. This mapping approach should prove to be useful for increasing biological knowledge on the too often neglected repetitive genomic regions. Availability: http://esbg.gatech.edu/jordan/software/map Contact: king.jordan@biology.gatech.edu Supplementary Information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2951085
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29510852010-10-08 A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags Wang, Jianrong Huda, Ahsan Lunyak, Victoria V. Jordan, I. King Bioinformatics Original Paper Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is widely used in biological research. ChIP-seq experiments yield many ambiguous tags that can be mapped with equal probability to multiple genomic sites. Such ambiguous tags are typically eliminated from consideration resulting in a potential loss of important biological information. Results: We have developed a Gibbs sampling-based algorithm for the genomic mapping of ambiguous sequence tags. Our algorithm relies on the local genomic tag context to guide the mapping of ambiguous tags. The Gibbs sampling procedure we use simultaneously maps ambiguous tags and updates the probabilities used to infer correct tag map positions. We show that our algorithm is able to correctly map more ambiguous tags than existing mapping methods. Our approach is also able to uncover mapped genomic sites from highly repetitive sequences that can not be detected based on unique tags alone, including transposable elements, segmental duplications and peri-centromeric regions. This mapping approach should prove to be useful for increasing biological knowledge on the too often neglected repetitive genomic regions. Availability: http://esbg.gatech.edu/jordan/software/map Contact: king.jordan@biology.gatech.edu Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-10-15 2010-09-24 /pmc/articles/PMC2951085/ /pubmed/20871106 http://dx.doi.org/10.1093/bioinformatics/btq460 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Wang, Jianrong
Huda, Ahsan
Lunyak, Victoria V.
Jordan, I. King
A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title_full A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title_fullStr A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title_full_unstemmed A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title_short A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
title_sort gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2951085/
https://www.ncbi.nlm.nih.gov/pubmed/20871106
http://dx.doi.org/10.1093/bioinformatics/btq460
work_keys_str_mv AT wangjianrong agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT hudaahsan agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT lunyakvictoriav agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT jordaniking agibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT wangjianrong gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT hudaahsan gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT lunyakvictoriav gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags
AT jordaniking gibbssamplingstrategyappliedtothemappingofambiguousshortsequencetags