Cargando…

A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology

We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Weixing, Liu, Yunlong, Wu, Jiejun, Nephew, Kenneth P, Huang, Tim HM, Li, Lang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559888/
https://www.ncbi.nlm.nih.gov/pubmed/18831789
http://dx.doi.org/10.1186/1471-2164-9-S2-S23
_version_ 1782159687125303296
author Feng, Weixing
Liu, Yunlong
Wu, Jiejun
Nephew, Kenneth P
Huang, Tim HM
Li, Lang
author_facet Feng, Weixing
Liu, Yunlong
Wu, Jiejun
Nephew, Kenneth P
Huang, Tim HM
Li, Lang
author_sort Feng, Weixing
collection PubMed
description We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a Poisson distribution. A Poisson mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17β-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a Poisson mixture model can be used to analyze ChIP-seq data.
format Text
id pubmed-2559888
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25598882008-10-04 A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology Feng, Weixing Liu, Yunlong Wu, Jiejun Nephew, Kenneth P Huang, Tim HM Li, Lang BMC Genomics Research We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a Poisson distribution. A Poisson mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17β-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a Poisson mixture model can be used to analyze ChIP-seq data. BioMed Central 2008-09-16 /pmc/articles/PMC2559888/ /pubmed/18831789 http://dx.doi.org/10.1186/1471-2164-9-S2-S23 Text en Copyright © 2008 Feng et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Feng, Weixing
Liu, Yunlong
Wu, Jiejun
Nephew, Kenneth P
Huang, Tim HM
Li, Lang
A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_full A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_fullStr A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_full_unstemmed A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_short A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_sort poisson mixture model to identify changes in rna polymerase ii binding quantity using high-throughput sequencing technology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559888/
https://www.ncbi.nlm.nih.gov/pubmed/18831789
http://dx.doi.org/10.1186/1471-2164-9-S2-S23
work_keys_str_mv AT fengweixing apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT liuyunlong apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT wujiejun apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT nephewkennethp apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT huangtimhm apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT lilang apoissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT fengweixing poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT liuyunlong poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT wujiejun poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT nephewkennethp poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT huangtimhm poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT lilang poissonmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology