Cargando…

Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals

Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Hao, Qin, Zhaohui S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679813/
https://www.ncbi.nlm.nih.gov/pubmed/23781505
http://dx.doi.org/10.1155/2013/617545
_version_ 1782273022379425792
author Wu, Hao
Qin, Zhaohui S.
author_facet Wu, Hao
Qin, Zhaohui S.
author_sort Wu, Hao
collection PubMed
description Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies such as ChIP-seq produce large amount of data in the form of genomic intervals. Most of the existing methods for assessing spatial relationships among the intervals are designed for pairwise comparison and cannot be easily scaled up. Results. We present a statistical method and software tool to characterize the cooccurrence patterns of multiple sets of genomic intervals. The occurrences of genomic intervals are described by a simple finite mixture model, where each component represents a distinct cooccurrence pattern. The model parameters are estimated via an EM algorithm and can be viewed as sufficient statistics of the cooccurrence patterns. Simulation and real data results show that the model can accurately capture the patterns and provide biologically meaningful results. The method is implemented in a freely available R package giClust. Conclusions. The method and the software provide a convenient way for biologists to explore the cooccurrence patterns among a relatively large number of sets of genomic intervals.
format Online
Article
Text
id pubmed-3679813
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-36798132013-06-18 Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals Wu, Hao Qin, Zhaohui S. Biomed Res Int Research Article Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies such as ChIP-seq produce large amount of data in the form of genomic intervals. Most of the existing methods for assessing spatial relationships among the intervals are designed for pairwise comparison and cannot be easily scaled up. Results. We present a statistical method and software tool to characterize the cooccurrence patterns of multiple sets of genomic intervals. The occurrences of genomic intervals are described by a simple finite mixture model, where each component represents a distinct cooccurrence pattern. The model parameters are estimated via an EM algorithm and can be viewed as sufficient statistics of the cooccurrence patterns. Simulation and real data results show that the model can accurately capture the patterns and provide biologically meaningful results. The method is implemented in a freely available R package giClust. Conclusions. The method and the software provide a convenient way for biologists to explore the cooccurrence patterns among a relatively large number of sets of genomic intervals. Hindawi Publishing Corporation 2013 2013-05-28 /pmc/articles/PMC3679813/ /pubmed/23781505 http://dx.doi.org/10.1155/2013/617545 Text en Copyright © 2013 H. Wu and Z. S. Qin. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wu, Hao
Qin, Zhaohui S.
Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title_full Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title_fullStr Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title_full_unstemmed Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title_short Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
title_sort exploring the cooccurrence patterns of multiple sets of genomic intervals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679813/
https://www.ncbi.nlm.nih.gov/pubmed/23781505
http://dx.doi.org/10.1155/2013/617545
work_keys_str_mv AT wuhao exploringthecooccurrencepatternsofmultiplesetsofgenomicintervals
AT qinzhaohuis exploringthecooccurrencepatternsofmultiplesetsofgenomicintervals