Cargando…
POSA: Perl Objects for DNA Sequencing Data Analysis
BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC516770/ https://www.ncbi.nlm.nih.gov/pubmed/15333141 http://dx.doi.org/10.1186/1471-2164-5-60 |
_version_ | 1782121770036232192 |
---|---|
author | Aerts, Jan A Jungerius, Bart J Groenen, Martien AM |
author_facet | Aerts, Jan A Jungerius, Bart J Groenen, Martien AM |
author_sort | Aerts, Jan A |
collection | PubMed |
description | BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide modules that need advanced informatics skills to allow implementation in pipelines. RESULTS: Here we present POSA, a pair of new perl objects that describe DNA sequence traces and Phrap contig assemblies in detail. Methods included in POSA include basecalling with quality scores (by Phred), contig assembly (by Phrap), generation of primer3 input and automated SNP annotation (by PolyPhred). Although easily implemented by users with only limited programming experience, these objects considerabily reduce hands-on analysis time compared to using the Staden package for extracting sequence information from raw sequencing files and for SNP discovery. CONCLUSIONS: The POSA objects allow a flexible and easy design, implementation and usage of perl-based pipelines to handle and analyze DNA sequencing data, while requiring only minor programming skills. |
format | Text |
id | pubmed-516770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-5167702004-09-12 POSA: Perl Objects for DNA Sequencing Data Analysis Aerts, Jan A Jungerius, Bart J Groenen, Martien AM BMC Genomics Software BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide modules that need advanced informatics skills to allow implementation in pipelines. RESULTS: Here we present POSA, a pair of new perl objects that describe DNA sequence traces and Phrap contig assemblies in detail. Methods included in POSA include basecalling with quality scores (by Phred), contig assembly (by Phrap), generation of primer3 input and automated SNP annotation (by PolyPhred). Although easily implemented by users with only limited programming experience, these objects considerabily reduce hands-on analysis time compared to using the Staden package for extracting sequence information from raw sequencing files and for SNP discovery. CONCLUSIONS: The POSA objects allow a flexible and easy design, implementation and usage of perl-based pipelines to handle and analyze DNA sequencing data, while requiring only minor programming skills. BioMed Central 2004-08-27 /pmc/articles/PMC516770/ /pubmed/15333141 http://dx.doi.org/10.1186/1471-2164-5-60 Text en Copyright © 2004 Aerts et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Aerts, Jan A Jungerius, Bart J Groenen, Martien AM POSA: Perl Objects for DNA Sequencing Data Analysis |
title | POSA: Perl Objects for DNA Sequencing Data Analysis |
title_full | POSA: Perl Objects for DNA Sequencing Data Analysis |
title_fullStr | POSA: Perl Objects for DNA Sequencing Data Analysis |
title_full_unstemmed | POSA: Perl Objects for DNA Sequencing Data Analysis |
title_short | POSA: Perl Objects for DNA Sequencing Data Analysis |
title_sort | posa: perl objects for dna sequencing data analysis |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC516770/ https://www.ncbi.nlm.nih.gov/pubmed/15333141 http://dx.doi.org/10.1186/1471-2164-5-60 |
work_keys_str_mv | AT aertsjana posaperlobjectsfordnasequencingdataanalysis AT jungeriusbartj posaperlobjectsfordnasequencingdataanalysis AT groenenmartienam posaperlobjectsfordnasequencingdataanalysis |