Cargando…
Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals
Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies suc...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679813/ https://www.ncbi.nlm.nih.gov/pubmed/23781505 http://dx.doi.org/10.1155/2013/617545 |
_version_ | 1782273022379425792 |
---|---|
author | Wu, Hao Qin, Zhaohui S. |
author_facet | Wu, Hao Qin, Zhaohui S. |
author_sort | Wu, Hao |
collection | PubMed |
description | Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies such as ChIP-seq produce large amount of data in the form of genomic intervals. Most of the existing methods for assessing spatial relationships among the intervals are designed for pairwise comparison and cannot be easily scaled up. Results. We present a statistical method and software tool to characterize the cooccurrence patterns of multiple sets of genomic intervals. The occurrences of genomic intervals are described by a simple finite mixture model, where each component represents a distinct cooccurrence pattern. The model parameters are estimated via an EM algorithm and can be viewed as sufficient statistics of the cooccurrence patterns. Simulation and real data results show that the model can accurately capture the patterns and provide biologically meaningful results. The method is implemented in a freely available R package giClust. Conclusions. The method and the software provide a convenient way for biologists to explore the cooccurrence patterns among a relatively large number of sets of genomic intervals. |
format | Online Article Text |
id | pubmed-3679813 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-36798132013-06-18 Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals Wu, Hao Qin, Zhaohui S. Biomed Res Int Research Article Background. Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies such as ChIP-seq produce large amount of data in the form of genomic intervals. Most of the existing methods for assessing spatial relationships among the intervals are designed for pairwise comparison and cannot be easily scaled up. Results. We present a statistical method and software tool to characterize the cooccurrence patterns of multiple sets of genomic intervals. The occurrences of genomic intervals are described by a simple finite mixture model, where each component represents a distinct cooccurrence pattern. The model parameters are estimated via an EM algorithm and can be viewed as sufficient statistics of the cooccurrence patterns. Simulation and real data results show that the model can accurately capture the patterns and provide biologically meaningful results. The method is implemented in a freely available R package giClust. Conclusions. The method and the software provide a convenient way for biologists to explore the cooccurrence patterns among a relatively large number of sets of genomic intervals. Hindawi Publishing Corporation 2013 2013-05-28 /pmc/articles/PMC3679813/ /pubmed/23781505 http://dx.doi.org/10.1155/2013/617545 Text en Copyright © 2013 H. Wu and Z. S. Qin. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Wu, Hao Qin, Zhaohui S. Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title | Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title_full | Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title_fullStr | Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title_full_unstemmed | Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title_short | Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals |
title_sort | exploring the cooccurrence patterns of multiple sets of genomic intervals |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3679813/ https://www.ncbi.nlm.nih.gov/pubmed/23781505 http://dx.doi.org/10.1155/2013/617545 |
work_keys_str_mv | AT wuhao exploringthecooccurrencepatternsofmultiplesetsofgenomicintervals AT qinzhaohuis exploringthecooccurrencepatternsofmultiplesetsofgenomicintervals |