Cargando…

POSA: Perl Objects for DNA Sequencing Data Analysis

BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide...

Descripción completa

Detalles Bibliográficos
Autores principales: Aerts, Jan A, Jungerius, Bart J, Groenen, Martien AM
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC516770/
https://www.ncbi.nlm.nih.gov/pubmed/15333141
http://dx.doi.org/10.1186/1471-2164-5-60
_version_ 1782121770036232192
author Aerts, Jan A
Jungerius, Bart J
Groenen, Martien AM
author_facet Aerts, Jan A
Jungerius, Bart J
Groenen, Martien AM
author_sort Aerts, Jan A
collection PubMed
description BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide modules that need advanced informatics skills to allow implementation in pipelines. RESULTS: Here we present POSA, a pair of new perl objects that describe DNA sequence traces and Phrap contig assemblies in detail. Methods included in POSA include basecalling with quality scores (by Phred), contig assembly (by Phrap), generation of primer3 input and automated SNP annotation (by PolyPhred). Although easily implemented by users with only limited programming experience, these objects considerabily reduce hands-on analysis time compared to using the Staden package for extracting sequence information from raw sequencing files and for SNP discovery. CONCLUSIONS: The POSA objects allow a flexible and easy design, implementation and usage of perl-based pipelines to handle and analyze DNA sequencing data, while requiring only minor programming skills.
format Text
id pubmed-516770
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5167702004-09-12 POSA: Perl Objects for DNA Sequencing Data Analysis Aerts, Jan A Jungerius, Bart J Groenen, Martien AM BMC Genomics Software BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide modules that need advanced informatics skills to allow implementation in pipelines. RESULTS: Here we present POSA, a pair of new perl objects that describe DNA sequence traces and Phrap contig assemblies in detail. Methods included in POSA include basecalling with quality scores (by Phred), contig assembly (by Phrap), generation of primer3 input and automated SNP annotation (by PolyPhred). Although easily implemented by users with only limited programming experience, these objects considerabily reduce hands-on analysis time compared to using the Staden package for extracting sequence information from raw sequencing files and for SNP discovery. CONCLUSIONS: The POSA objects allow a flexible and easy design, implementation and usage of perl-based pipelines to handle and analyze DNA sequencing data, while requiring only minor programming skills. BioMed Central 2004-08-27 /pmc/articles/PMC516770/ /pubmed/15333141 http://dx.doi.org/10.1186/1471-2164-5-60 Text en Copyright © 2004 Aerts et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Aerts, Jan A
Jungerius, Bart J
Groenen, Martien AM
POSA: Perl Objects for DNA Sequencing Data Analysis
title POSA: Perl Objects for DNA Sequencing Data Analysis
title_full POSA: Perl Objects for DNA Sequencing Data Analysis
title_fullStr POSA: Perl Objects for DNA Sequencing Data Analysis
title_full_unstemmed POSA: Perl Objects for DNA Sequencing Data Analysis
title_short POSA: Perl Objects for DNA Sequencing Data Analysis
title_sort posa: perl objects for dna sequencing data analysis
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC516770/
https://www.ncbi.nlm.nih.gov/pubmed/15333141
http://dx.doi.org/10.1186/1471-2164-5-60
work_keys_str_mv AT aertsjana posaperlobjectsfordnasequencingdataanalysis
AT jungeriusbartj posaperlobjectsfordnasequencingdataanalysis
AT groenenmartienam posaperlobjectsfordnasequencingdataanalysis