Cargando…

Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data

BACKGROUND: High-throughput sequencing is generating massive amounts of data at a pace that largely exceeds the throughput of data analysis routines. Here we introduce Fish the ChIPs (FC), a computational pipeline aimed at a broad public of users and designed to perform complete ChIP-Seq data analys...

Descripción completa

Detalles Bibliográficos
Autores principales: Barozzi, Iros, Termanini, Alberto, Minucci, Saverio, Natoli, Gioacchino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3201895/
https://www.ncbi.nlm.nih.gov/pubmed/21978789
http://dx.doi.org/10.1186/1745-6150-6-51
_version_ 1782214935767416832
author Barozzi, Iros
Termanini, Alberto
Minucci, Saverio
Natoli, Gioacchino
author_facet Barozzi, Iros
Termanini, Alberto
Minucci, Saverio
Natoli, Gioacchino
author_sort Barozzi, Iros
collection PubMed
description BACKGROUND: High-throughput sequencing is generating massive amounts of data at a pace that largely exceeds the throughput of data analysis routines. Here we introduce Fish the ChIPs (FC), a computational pipeline aimed at a broad public of users and designed to perform complete ChIP-Seq data analysis of an unlimited number of samples, thus increasing throughput, reproducibility and saving time. RESULTS: Starting from short read sequences, FC performs the following steps: 1) quality controls, 2) alignment to a reference genome, 3) peak calling, 4) genomic annotation, 5) generation of raw signal tracks for visualization on the UCSC and IGV genome browsers. FC exploits some of the fastest and most effective tools today available. Installation on a Mac platform requires very basic computational skills while configuration and usage are supported by a user-friendly graphic user interface. Alternatively, FC can be compiled from the source code on any Unix machine and then run with the possibility of customizing each single parameter through a simple configuration text file that can be generated using a dedicated user-friendly web-form. Considering the execution time, FC can be run on a desktop machine, even though the use of a computer cluster is recommended for analyses of large batches of data. FC is perfectly suited to work with data coming from Illumina Solexa Genome Analyzers or ABI SOLiD and its usage can potentially be extended to any sequencing platform. CONCLUSIONS: Compared to existing tools, FC has two main advantages that make it suitable for a broad range of users. First of all, it can be installed and run by wet biologists on a Mac machine. Besides it can handle an unlimited number of samples, being convenient for large analyses. In this context, computational biologists can increase reproducibility of their ChIP-Seq data analyses while saving time for downstream analyses. REVIEWERS: This article was reviewed by Gavin Huttley, George Shpakovski and Sarah Teichmann.
format Online
Article
Text
id pubmed-3201895
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32018952011-10-26 Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data Barozzi, Iros Termanini, Alberto Minucci, Saverio Natoli, Gioacchino Biol Direct Research BACKGROUND: High-throughput sequencing is generating massive amounts of data at a pace that largely exceeds the throughput of data analysis routines. Here we introduce Fish the ChIPs (FC), a computational pipeline aimed at a broad public of users and designed to perform complete ChIP-Seq data analysis of an unlimited number of samples, thus increasing throughput, reproducibility and saving time. RESULTS: Starting from short read sequences, FC performs the following steps: 1) quality controls, 2) alignment to a reference genome, 3) peak calling, 4) genomic annotation, 5) generation of raw signal tracks for visualization on the UCSC and IGV genome browsers. FC exploits some of the fastest and most effective tools today available. Installation on a Mac platform requires very basic computational skills while configuration and usage are supported by a user-friendly graphic user interface. Alternatively, FC can be compiled from the source code on any Unix machine and then run with the possibility of customizing each single parameter through a simple configuration text file that can be generated using a dedicated user-friendly web-form. Considering the execution time, FC can be run on a desktop machine, even though the use of a computer cluster is recommended for analyses of large batches of data. FC is perfectly suited to work with data coming from Illumina Solexa Genome Analyzers or ABI SOLiD and its usage can potentially be extended to any sequencing platform. CONCLUSIONS: Compared to existing tools, FC has two main advantages that make it suitable for a broad range of users. First of all, it can be installed and run by wet biologists on a Mac machine. Besides it can handle an unlimited number of samples, being convenient for large analyses. In this context, computational biologists can increase reproducibility of their ChIP-Seq data analyses while saving time for downstream analyses. REVIEWERS: This article was reviewed by Gavin Huttley, George Shpakovski and Sarah Teichmann. BioMed Central 2011-10-06 /pmc/articles/PMC3201895/ /pubmed/21978789 http://dx.doi.org/10.1186/1745-6150-6-51 Text en Copyright ©2011 Barozzi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Barozzi, Iros
Termanini, Alberto
Minucci, Saverio
Natoli, Gioacchino
Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title_full Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title_fullStr Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title_full_unstemmed Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title_short Fish the ChIPs: a pipeline for automated genomic annotation of ChIP-Seq data
title_sort fish the chips: a pipeline for automated genomic annotation of chip-seq data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3201895/
https://www.ncbi.nlm.nih.gov/pubmed/21978789
http://dx.doi.org/10.1186/1745-6150-6-51
work_keys_str_mv AT barozziiros fishthechipsapipelineforautomatedgenomicannotationofchipseqdata
AT termaninialberto fishthechipsapipelineforautomatedgenomicannotationofchipseqdata
AT minuccisaverio fishthechipsapipelineforautomatedgenomicannotationofchipseqdata
AT natoligioacchino fishthechipsapipelineforautomatedgenomicannotationofchipseqdata