Cargando…
Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the gen...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove Medical Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3810201/ https://www.ncbi.nlm.nih.gov/pubmed/24187504 http://dx.doi.org/10.2147/AABC.S51271 |
_version_ | 1782288758939320320 |
---|---|
author | Taskesen, Erdogan Hoogeboezem, Remco Delwel, Ruud Reinders, Marcel JT |
author_facet | Taskesen, Erdogan Hoogeboezem, Remco Delwel, Ruud Reinders, Marcel JT |
author_sort | Taskesen, Erdogan |
collection | PubMed |
description | Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the genome in which contiguous regions of peaks on the genome represent the presence of an interacting molecule. Many methods do exist to identify functional regions of interest (ROIs) on the genome. However the detection of ROIs are often not an end-point in research questions and it therefore requires data dragging between tools to relate the ROIs to information present in databases, such as gene-ontology, pathway information, or enrichment of certain genomic content. We introduce hypergeometric analysis of tiling-array and sequence data (HATSEQ), a powerful tool that accurately identifies functional ROIs on the genome where a genomic signal significantly deviates from the general genome-wide behavior. HATSEQ also includes a number of built-in post-analyses with which biological meaning can be attached to the detected ROIs in terms of gene pathways and de-novo motif analysis, and provides different visualizations and statistical summaries for the detected ROIs. In addition, HATSEQ has an intuitive graphic user interface that lowers the barrier for researchers to analyze their data without the need of scripting languages. We compared the results of HATSEQ against two other popular chromatin immunoprecipitation sequencing (ChIP-Seq) methods and observed overlap in the detected ROIs but HATSEQ is more specific in delineating the peak boundaries. We also discuss the versatility of HATSEQ by using a Signal Transducer and Activator of Transcription 1 (STAT1) ChIP-Seq data-set, and show that the detected ROIs are highly specific for the expected STAT1 binding motif. HATSEQ is freely available at: http://hema13.erasmusmc.nl/index.php/HATSEQ. |
format | Online Article Text |
id | pubmed-3810201 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Dove Medical Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-38102012013-11-01 Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks Taskesen, Erdogan Hoogeboezem, Remco Delwel, Ruud Reinders, Marcel JT Adv Appl Bioinform Chem Original Research Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the genome in which contiguous regions of peaks on the genome represent the presence of an interacting molecule. Many methods do exist to identify functional regions of interest (ROIs) on the genome. However the detection of ROIs are often not an end-point in research questions and it therefore requires data dragging between tools to relate the ROIs to information present in databases, such as gene-ontology, pathway information, or enrichment of certain genomic content. We introduce hypergeometric analysis of tiling-array and sequence data (HATSEQ), a powerful tool that accurately identifies functional ROIs on the genome where a genomic signal significantly deviates from the general genome-wide behavior. HATSEQ also includes a number of built-in post-analyses with which biological meaning can be attached to the detected ROIs in terms of gene pathways and de-novo motif analysis, and provides different visualizations and statistical summaries for the detected ROIs. In addition, HATSEQ has an intuitive graphic user interface that lowers the barrier for researchers to analyze their data without the need of scripting languages. We compared the results of HATSEQ against two other popular chromatin immunoprecipitation sequencing (ChIP-Seq) methods and observed overlap in the detected ROIs but HATSEQ is more specific in delineating the peak boundaries. We also discuss the versatility of HATSEQ by using a Signal Transducer and Activator of Transcription 1 (STAT1) ChIP-Seq data-set, and show that the detected ROIs are highly specific for the expected STAT1 binding motif. HATSEQ is freely available at: http://hema13.erasmusmc.nl/index.php/HATSEQ. Dove Medical Press 2013-10-25 /pmc/articles/PMC3810201/ /pubmed/24187504 http://dx.doi.org/10.2147/AABC.S51271 Text en © 2013 Taskesen et al. This work is published by Dove Medical Press Ltd, and licensed under Creative Commons Attribution – Non Commercial (unported, v3.0) License The full terms of the License are available at http://creativecommons.org/licenses/by-nc/3.0/. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Ltd, provided the work is properly attributed. |
spellingShingle | Original Research Taskesen, Erdogan Hoogeboezem, Remco Delwel, Ruud Reinders, Marcel JT Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title | Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title_full | Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title_fullStr | Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title_full_unstemmed | Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title_short | Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
title_sort | hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3810201/ https://www.ncbi.nlm.nih.gov/pubmed/24187504 http://dx.doi.org/10.2147/AABC.S51271 |
work_keys_str_mv | AT taskesenerdogan hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks AT hoogeboezemremco hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks AT delwelruud hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks AT reindersmarceljt hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks |