Cargando…

RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)

BACKGROUND: Next generation sequencing based technologies are being extensively used to study transcriptomes. Among these, cap analysis of gene expression (CAGE) is specialized in detecting the most 5’ ends of RNA molecules. After mapping the sequenced reads back to a reference genome CAGE data high...

Descripción completa

Detalles Bibliográficos
Autores principales: Ohmiya, Hiroko, Vitezic, Morana, Frith, Martin C, Itoh, Masayoshi, Carninci, Piero, Forrest, Alistair RR, Hayashizaki, Yoshihide, Lassmann, Timo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029093/
https://www.ncbi.nlm.nih.gov/pubmed/24779366
http://dx.doi.org/10.1186/1471-2164-15-269
_version_ 1782317159306756096
author Ohmiya, Hiroko
Vitezic, Morana
Frith, Martin C
Itoh, Masayoshi
Carninci, Piero
Forrest, Alistair RR
Hayashizaki, Yoshihide
Lassmann, Timo
author_facet Ohmiya, Hiroko
Vitezic, Morana
Frith, Martin C
Itoh, Masayoshi
Carninci, Piero
Forrest, Alistair RR
Hayashizaki, Yoshihide
Lassmann, Timo
author_sort Ohmiya, Hiroko
collection PubMed
description BACKGROUND: Next generation sequencing based technologies are being extensively used to study transcriptomes. Among these, cap analysis of gene expression (CAGE) is specialized in detecting the most 5’ ends of RNA molecules. After mapping the sequenced reads back to a reference genome CAGE data highlights the transcriptional start sites (TSSs) and their usage at a single nucleotide resolution. RESULTS: We propose a pipeline to group the single nucleotide TSS into larger reproducible peaks and compare their usage across biological states. Importantly, our pipeline discovers broad peaks as well as the fine structure of individual transcriptional start sites embedded within them. We assess the performance of our approach on a large CAGE datasets including 156 primary cell types and two cell lines with biological replicas. We demonstrate that genes have complicated structures of transcription initiation events. In particular, we discover that narrow peaks embedded in broader regions of transcriptional activity can be differentially used even if the larger region is not. CONCLUSIONS: By examining the reproducible fine scaled organization of TSS we can detect many differentially regulated peaks undetected by previous approaches.
format Online
Article
Text
id pubmed-4029093
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40290932014-06-04 RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE) Ohmiya, Hiroko Vitezic, Morana Frith, Martin C Itoh, Masayoshi Carninci, Piero Forrest, Alistair RR Hayashizaki, Yoshihide Lassmann, Timo BMC Genomics Software BACKGROUND: Next generation sequencing based technologies are being extensively used to study transcriptomes. Among these, cap analysis of gene expression (CAGE) is specialized in detecting the most 5’ ends of RNA molecules. After mapping the sequenced reads back to a reference genome CAGE data highlights the transcriptional start sites (TSSs) and their usage at a single nucleotide resolution. RESULTS: We propose a pipeline to group the single nucleotide TSS into larger reproducible peaks and compare their usage across biological states. Importantly, our pipeline discovers broad peaks as well as the fine structure of individual transcriptional start sites embedded within them. We assess the performance of our approach on a large CAGE datasets including 156 primary cell types and two cell lines with biological replicas. We demonstrate that genes have complicated structures of transcription initiation events. In particular, we discover that narrow peaks embedded in broader regions of transcriptional activity can be differentially used even if the larger region is not. CONCLUSIONS: By examining the reproducible fine scaled organization of TSS we can detect many differentially regulated peaks undetected by previous approaches. BioMed Central 2014-04-25 /pmc/articles/PMC4029093/ /pubmed/24779366 http://dx.doi.org/10.1186/1471-2164-15-269 Text en Copyright © 2014 Ohmiya et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Ohmiya, Hiroko
Vitezic, Morana
Frith, Martin C
Itoh, Masayoshi
Carninci, Piero
Forrest, Alistair RR
Hayashizaki, Yoshihide
Lassmann, Timo
RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title_full RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title_fullStr RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title_full_unstemmed RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title_short RECLU: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (CAGE)
title_sort reclu: a pipeline to discover reproducible transcriptional start sites and their alternative regulation using capped analysis of gene expression (cage)
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029093/
https://www.ncbi.nlm.nih.gov/pubmed/24779366
http://dx.doi.org/10.1186/1471-2164-15-269
work_keys_str_mv AT ohmiyahiroko recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT vitezicmorana recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT frithmartinc recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT itohmasayoshi recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT carnincipiero recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT forrestalistairrr recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT hayashizakiyoshihide recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage
AT lassmanntimo recluapipelinetodiscoverreproducibletranscriptionalstartsitesandtheiralternativeregulationusingcappedanalysisofgeneexpressioncage