Cargando…

NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites

BACKGROUND: Identifying the transcription start sites (TSS) of genes is essential for characterizing promoter regions. Several protocols have been developed to capture the 5′ end of transcripts via Cap Analysis of Gene Expression (CAGE) or linker-ligation strategies such as Paired-End Analysis of Tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Cumbie, Jason S., Ivanchenko, Maria G., Megraw, Molly
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534009/
https://www.ncbi.nlm.nih.gov/pubmed/26268438
http://dx.doi.org/10.1186/s12864-015-1670-6
_version_ 1782385394192482304
author Cumbie, Jason S.
Ivanchenko, Maria G.
Megraw, Molly
author_facet Cumbie, Jason S.
Ivanchenko, Maria G.
Megraw, Molly
author_sort Cumbie, Jason S.
collection PubMed
description BACKGROUND: Identifying the transcription start sites (TSS) of genes is essential for characterizing promoter regions. Several protocols have been developed to capture the 5′ end of transcripts via Cap Analysis of Gene Expression (CAGE) or linker-ligation strategies such as Paired-End Analysis of Transcription Start Sites (PEAT), but often require large amounts of tissue. More recently, nanoCAGE was developed for sequencing on the Illumina GAIIx to overcome these difficulties. RESULTS: Here we present the first publicly available adaptation of nanoCAGE for sequencing on recent ultra-high throughput platforms such as Illumina HiSeq-2000, and CapFilter, a computational pipeline that greatly increases confidence in TSS identification. We report excellent gene coverage, reproducibility, and precision in transcription start site discovery for samples from Arabidopsis thaliana roots. CONCLUSION: nanoCAGE-XL together with CapFilter allows for genome wide identification of high confidence transcription start sites in large eukaryotic genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1670-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4534009
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45340092015-08-13 NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites Cumbie, Jason S. Ivanchenko, Maria G. Megraw, Molly BMC Genomics Methodology Article BACKGROUND: Identifying the transcription start sites (TSS) of genes is essential for characterizing promoter regions. Several protocols have been developed to capture the 5′ end of transcripts via Cap Analysis of Gene Expression (CAGE) or linker-ligation strategies such as Paired-End Analysis of Transcription Start Sites (PEAT), but often require large amounts of tissue. More recently, nanoCAGE was developed for sequencing on the Illumina GAIIx to overcome these difficulties. RESULTS: Here we present the first publicly available adaptation of nanoCAGE for sequencing on recent ultra-high throughput platforms such as Illumina HiSeq-2000, and CapFilter, a computational pipeline that greatly increases confidence in TSS identification. We report excellent gene coverage, reproducibility, and precision in transcription start site discovery for samples from Arabidopsis thaliana roots. CONCLUSION: nanoCAGE-XL together with CapFilter allows for genome wide identification of high confidence transcription start sites in large eukaryotic genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1670-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-13 /pmc/articles/PMC4534009/ /pubmed/26268438 http://dx.doi.org/10.1186/s12864-015-1670-6 Text en © Cumbie et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Cumbie, Jason S.
Ivanchenko, Maria G.
Megraw, Molly
NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title_full NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title_fullStr NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title_full_unstemmed NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title_short NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites
title_sort nanocage-xl and capfilter: an approach to genome wide identification of high confidence transcription start sites
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534009/
https://www.ncbi.nlm.nih.gov/pubmed/26268438
http://dx.doi.org/10.1186/s12864-015-1670-6
work_keys_str_mv AT cumbiejasons nanocagexlandcapfilteranapproachtogenomewideidentificationofhighconfidencetranscriptionstartsites
AT ivanchenkomariag nanocagexlandcapfilteranapproachtogenomewideidentificationofhighconfidencetranscriptionstartsites
AT megrawmolly nanocagexlandcapfilteranapproachtogenomewideidentificationofhighconfidencetranscriptionstartsites