Cargando…

SICTIN: Rapid footprinting of massively parallel sequencing data

BACKGROUND: Massively parallel sequencing allows for genome-wide hypothesis-free investigation of for instance transcription factor binding sites or histone modifications. Although nucleotide resolution detailed information can easily be generated, biological insight often requires a more general vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Enroth, Stefan, Andersson, Robin, Wadelius, Claes, Komorowski, Jan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928217/
https://www.ncbi.nlm.nih.gov/pubmed/20707885
http://dx.doi.org/10.1186/1756-0381-3-4
_version_ 1782185839859597312
author Enroth, Stefan
Andersson, Robin
Wadelius, Claes
Komorowski, Jan
author_facet Enroth, Stefan
Andersson, Robin
Wadelius, Claes
Komorowski, Jan
author_sort Enroth, Stefan
collection PubMed
description BACKGROUND: Massively parallel sequencing allows for genome-wide hypothesis-free investigation of for instance transcription factor binding sites or histone modifications. Although nucleotide resolution detailed information can easily be generated, biological insight often requires a more general view of patterns (footprints) over distinct genomic features such as transcription start sites, exons or repetitive regions. The construction of these footprints is however a time consuming task. METHODS: The presented software generates a binary representation of the signals enabling fast and scalable lookup. This representation allows for footprint generation in mere minutes on a desktop computer. Several different input formats are accepted, e.g. the SAM format, bed-files and the UCSC wiggle track. CONCLUSIONS: Hypothesis-free investigation of genome wide interactions allows for biological data mining at a scale never before seen. Until recently, the main focus of analysis of sequencing data has been targeted on signal patterns around transcriptional start sites which are in manageable numbers. Today, focus is shifting to a wider perspective and numerous genomic features are being studied. To this end, we provide a system allowing for fast querying in the order of hundreds of thousands of features.
format Text
id pubmed-2928217
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29282172010-08-26 SICTIN: Rapid footprinting of massively parallel sequencing data Enroth, Stefan Andersson, Robin Wadelius, Claes Komorowski, Jan BioData Min Software Article BACKGROUND: Massively parallel sequencing allows for genome-wide hypothesis-free investigation of for instance transcription factor binding sites or histone modifications. Although nucleotide resolution detailed information can easily be generated, biological insight often requires a more general view of patterns (footprints) over distinct genomic features such as transcription start sites, exons or repetitive regions. The construction of these footprints is however a time consuming task. METHODS: The presented software generates a binary representation of the signals enabling fast and scalable lookup. This representation allows for footprint generation in mere minutes on a desktop computer. Several different input formats are accepted, e.g. the SAM format, bed-files and the UCSC wiggle track. CONCLUSIONS: Hypothesis-free investigation of genome wide interactions allows for biological data mining at a scale never before seen. Until recently, the main focus of analysis of sequencing data has been targeted on signal patterns around transcriptional start sites which are in manageable numbers. Today, focus is shifting to a wider perspective and numerous genomic features are being studied. To this end, we provide a system allowing for fast querying in the order of hundreds of thousands of features. BioMed Central 2010-08-13 /pmc/articles/PMC2928217/ /pubmed/20707885 http://dx.doi.org/10.1186/1756-0381-3-4 Text en Copyright ©2010 Enroth et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Article
Enroth, Stefan
Andersson, Robin
Wadelius, Claes
Komorowski, Jan
SICTIN: Rapid footprinting of massively parallel sequencing data
title SICTIN: Rapid footprinting of massively parallel sequencing data
title_full SICTIN: Rapid footprinting of massively parallel sequencing data
title_fullStr SICTIN: Rapid footprinting of massively parallel sequencing data
title_full_unstemmed SICTIN: Rapid footprinting of massively parallel sequencing data
title_short SICTIN: Rapid footprinting of massively parallel sequencing data
title_sort sictin: rapid footprinting of massively parallel sequencing data
topic Software Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928217/
https://www.ncbi.nlm.nih.gov/pubmed/20707885
http://dx.doi.org/10.1186/1756-0381-3-4
work_keys_str_mv AT enrothstefan sictinrapidfootprintingofmassivelyparallelsequencingdata
AT anderssonrobin sictinrapidfootprintingofmassivelyparallelsequencingdata
AT wadeliusclaes sictinrapidfootprintingofmassivelyparallelsequencingdata
AT komorowskijan sictinrapidfootprintingofmassivelyparallelsequencingdata