Cargando…

Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks

Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Taskesen, Erdogan, Hoogeboezem, Remco, Delwel, Ruud, Reinders, Marcel JT
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove Medical Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3810201/
https://www.ncbi.nlm.nih.gov/pubmed/24187504
http://dx.doi.org/10.2147/AABC.S51271
_version_ 1782288758939320320
author Taskesen, Erdogan
Hoogeboezem, Remco
Delwel, Ruud
Reinders, Marcel JT
author_facet Taskesen, Erdogan
Hoogeboezem, Remco
Delwel, Ruud
Reinders, Marcel JT
author_sort Taskesen, Erdogan
collection PubMed
description Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the genome in which contiguous regions of peaks on the genome represent the presence of an interacting molecule. Many methods do exist to identify functional regions of interest (ROIs) on the genome. However the detection of ROIs are often not an end-point in research questions and it therefore requires data dragging between tools to relate the ROIs to information present in databases, such as gene-ontology, pathway information, or enrichment of certain genomic content. We introduce hypergeometric analysis of tiling-array and sequence data (HATSEQ), a powerful tool that accurately identifies functional ROIs on the genome where a genomic signal significantly deviates from the general genome-wide behavior. HATSEQ also includes a number of built-in post-analyses with which biological meaning can be attached to the detected ROIs in terms of gene pathways and de-novo motif analysis, and provides different visualizations and statistical summaries for the detected ROIs. In addition, HATSEQ has an intuitive graphic user interface that lowers the barrier for researchers to analyze their data without the need of scripting languages. We compared the results of HATSEQ against two other popular chromatin immunoprecipitation sequencing (ChIP-Seq) methods and observed overlap in the detected ROIs but HATSEQ is more specific in delineating the peak boundaries. We also discuss the versatility of HATSEQ by using a Signal Transducer and Activator of Transcription 1 (STAT1) ChIP-Seq data-set, and show that the detected ROIs are highly specific for the expected STAT1 binding motif. HATSEQ is freely available at: http://hema13.erasmusmc.nl/index.php/HATSEQ.
format Online
Article
Text
id pubmed-3810201
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Dove Medical Press
record_format MEDLINE/PubMed
spelling pubmed-38102012013-11-01 Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks Taskesen, Erdogan Hoogeboezem, Remco Delwel, Ruud Reinders, Marcel JT Adv Appl Bioinform Chem Original Research Probing protein-deoxyribonucleic acid (DNA) is gaining popularity as it sheds light on molecular mechanisms that regulate the expression of genes. Currently, tiling-arrays and next-generation sequencing technology can be used to measure these interactions. Both methods generate a signal over the genome in which contiguous regions of peaks on the genome represent the presence of an interacting molecule. Many methods do exist to identify functional regions of interest (ROIs) on the genome. However the detection of ROIs are often not an end-point in research questions and it therefore requires data dragging between tools to relate the ROIs to information present in databases, such as gene-ontology, pathway information, or enrichment of certain genomic content. We introduce hypergeometric analysis of tiling-array and sequence data (HATSEQ), a powerful tool that accurately identifies functional ROIs on the genome where a genomic signal significantly deviates from the general genome-wide behavior. HATSEQ also includes a number of built-in post-analyses with which biological meaning can be attached to the detected ROIs in terms of gene pathways and de-novo motif analysis, and provides different visualizations and statistical summaries for the detected ROIs. In addition, HATSEQ has an intuitive graphic user interface that lowers the barrier for researchers to analyze their data without the need of scripting languages. We compared the results of HATSEQ against two other popular chromatin immunoprecipitation sequencing (ChIP-Seq) methods and observed overlap in the detected ROIs but HATSEQ is more specific in delineating the peak boundaries. We also discuss the versatility of HATSEQ by using a Signal Transducer and Activator of Transcription 1 (STAT1) ChIP-Seq data-set, and show that the detected ROIs are highly specific for the expected STAT1 binding motif. HATSEQ is freely available at: http://hema13.erasmusmc.nl/index.php/HATSEQ. Dove Medical Press 2013-10-25 /pmc/articles/PMC3810201/ /pubmed/24187504 http://dx.doi.org/10.2147/AABC.S51271 Text en © 2013 Taskesen et al. This work is published by Dove Medical Press Ltd, and licensed under Creative Commons Attribution – Non Commercial (unported, v3.0) License The full terms of the License are available at http://creativecommons.org/licenses/by-nc/3.0/. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Ltd, provided the work is properly attributed.
spellingShingle Original Research
Taskesen, Erdogan
Hoogeboezem, Remco
Delwel, Ruud
Reinders, Marcel JT
Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title_full Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title_fullStr Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title_full_unstemmed Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title_short Hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
title_sort hypergeometric analysis of tiling-array and sequence data: detection and interpretation of peaks
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3810201/
https://www.ncbi.nlm.nih.gov/pubmed/24187504
http://dx.doi.org/10.2147/AABC.S51271
work_keys_str_mv AT taskesenerdogan hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks
AT hoogeboezemremco hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks
AT delwelruud hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks
AT reindersmarceljt hypergeometricanalysisoftilingarrayandsequencedatadetectionandinterpretationofpeaks