Cargando…

Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation

BACKGROUND: The emergence of Next Generation Sequencing technologies has made it possible for individual investigators to generate gigabases of sequencing data per week. Effective analysis and manipulation of these data is limited due to large file sizes, so even simple tasks such as data filtration...

Descripción completa

Detalles Bibliográficos
Autores principales: Golovko, Georgiy, Khanipov, Kamil, Rojas, Mark, Martinez-Alcántara, Antonio, Howard, Jesse J, Ballesteros, Efren, Gupta, Sharu, Widger, William, Fofanov, Yuriy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3505481/
https://www.ncbi.nlm.nih.gov/pubmed/22800377
http://dx.doi.org/10.1186/1471-2105-13-166
_version_ 1782250763573002240
author Golovko, Georgiy
Khanipov, Kamil
Rojas, Mark
Martinez-Alcántara, Antonio
Howard, Jesse J
Ballesteros, Efren
Gupta, Sharu
Widger, William
Fofanov, Yuriy
author_facet Golovko, Georgiy
Khanipov, Kamil
Rojas, Mark
Martinez-Alcántara, Antonio
Howard, Jesse J
Ballesteros, Efren
Gupta, Sharu
Widger, William
Fofanov, Yuriy
author_sort Golovko, Georgiy
collection PubMed
description BACKGROUND: The emergence of Next Generation Sequencing technologies has made it possible for individual investigators to generate gigabases of sequencing data per week. Effective analysis and manipulation of these data is limited due to large file sizes, so even simple tasks such as data filtration and quality assessment have to be performed in several steps. This requires (potentially problematic) interaction between the investigator and a bioinformatics/computational service provider. Furthermore, such services are often performed using specialized computational facilities. RESULTS: We present a Windows-based application, Slim-Filter designed to interactively examine the statistical properties of sequencing reads produced by Illumina Genome Analyzer and to perform a broad spectrum of data manipulation tasks including: filtration of low quality and low complexity reads; filtration of reads containing undesired subsequences (such as parts of adapters and PCR primers used during the sample and sequencing libraries preparation steps); excluding duplicated reads (while keeping each read’s copy number information in a specialized data format); and sorting reads by copy numbers allowing for easy access and manual editing of the resulting files. Slim-Filter is organized as a sequence of windows summarizing the statistical properties of the reads. Each data manipulation step has roll-back abilities, allowing for return to previous steps of the data analysis process. Slim-Filter is written in C++ and is compatible with fasta, fastq, and specialized AS file formats presented in this manuscript. Setup files and a user’s manual are available for download at the supplementary web site ( https://www.bioinfo.uh.edu/Slim_Filter/). CONCLUSION: The presented Windows-based application has been developed with the goal of providing individual investigators with integrated sequencing reads analysis, curation, and manipulation capabilities.
format Online
Article
Text
id pubmed-3505481
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35054812012-11-29 Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation Golovko, Georgiy Khanipov, Kamil Rojas, Mark Martinez-Alcántara, Antonio Howard, Jesse J Ballesteros, Efren Gupta, Sharu Widger, William Fofanov, Yuriy BMC Bioinformatics Software BACKGROUND: The emergence of Next Generation Sequencing technologies has made it possible for individual investigators to generate gigabases of sequencing data per week. Effective analysis and manipulation of these data is limited due to large file sizes, so even simple tasks such as data filtration and quality assessment have to be performed in several steps. This requires (potentially problematic) interaction between the investigator and a bioinformatics/computational service provider. Furthermore, such services are often performed using specialized computational facilities. RESULTS: We present a Windows-based application, Slim-Filter designed to interactively examine the statistical properties of sequencing reads produced by Illumina Genome Analyzer and to perform a broad spectrum of data manipulation tasks including: filtration of low quality and low complexity reads; filtration of reads containing undesired subsequences (such as parts of adapters and PCR primers used during the sample and sequencing libraries preparation steps); excluding duplicated reads (while keeping each read’s copy number information in a specialized data format); and sorting reads by copy numbers allowing for easy access and manual editing of the resulting files. Slim-Filter is organized as a sequence of windows summarizing the statistical properties of the reads. Each data manipulation step has roll-back abilities, allowing for return to previous steps of the data analysis process. Slim-Filter is written in C++ and is compatible with fasta, fastq, and specialized AS file formats presented in this manuscript. Setup files and a user’s manual are available for download at the supplementary web site ( https://www.bioinfo.uh.edu/Slim_Filter/). CONCLUSION: The presented Windows-based application has been developed with the goal of providing individual investigators with integrated sequencing reads analysis, curation, and manipulation capabilities. BioMed Central 2012-07-16 /pmc/articles/PMC3505481/ /pubmed/22800377 http://dx.doi.org/10.1186/1471-2105-13-166 Text en Copyright ©2012 Golovko et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Golovko, Georgiy
Khanipov, Kamil
Rojas, Mark
Martinez-Alcántara, Antonio
Howard, Jesse J
Ballesteros, Efren
Gupta, Sharu
Widger, William
Fofanov, Yuriy
Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title_full Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title_fullStr Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title_full_unstemmed Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title_short Slim-Filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
title_sort slim-filter: an interactive windows-based application for illumina genome analyzer data assessment and manipulation
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3505481/
https://www.ncbi.nlm.nih.gov/pubmed/22800377
http://dx.doi.org/10.1186/1471-2105-13-166
work_keys_str_mv AT golovkogeorgiy slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT khanipovkamil slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT rojasmark slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT martinezalcantaraantonio slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT howardjessej slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT ballesterosefren slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT guptasharu slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT widgerwilliam slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation
AT fofanovyuriy slimfilteraninteractivewindowsbasedapplicationforilluminagenomeanalyzerdataassessmentandmanipulation