Cargando…

MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools

BACKGROUND: Processing raw DNA sequence data is an especially challenging task for relatively small laboratories and core facilities that produce as many as 5000 or more DNA sequences per week from multiple projects in widely differing species. To meet this challenge, we have developed the flexible,...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Chun, Sun, Feng, Wang, Haiming, Qu, Junfeng, Freeman, Robert M, Pratt, Lee H, Cordonnier-Pratt, Marie-Michèle
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421442/
https://www.ncbi.nlm.nih.gov/pubmed/16522212
http://dx.doi.org/10.1186/1471-2105-7-115
_version_ 1782127179615698944
author Liang, Chun
Sun, Feng
Wang, Haiming
Qu, Junfeng
Freeman, Robert M
Pratt, Lee H
Cordonnier-Pratt, Marie-Michèle
author_facet Liang, Chun
Sun, Feng
Wang, Haiming
Qu, Junfeng
Freeman, Robert M
Pratt, Lee H
Cordonnier-Pratt, Marie-Michèle
author_sort Liang, Chun
collection PubMed
description BACKGROUND: Processing raw DNA sequence data is an especially challenging task for relatively small laboratories and core facilities that produce as many as 5000 or more DNA sequences per week from multiple projects in widely differing species. To meet this challenge, we have developed the flexible, scalable, and automated sequence processing package described here. RESULTS: MAGIC-SPP is a DNA sequence processing package consisting of an Oracle 9i relational database, a Perl pipeline, and user interfaces implemented either as JavaServer Pages (JSP) or as a Java graphical user interface (GUI). The database not only serves as a data repository, but also controls processing of trace files. MAGIC-SPP includes an administrative interface, a laboratory information management system, and interfaces for exploring sequences, monitoring quality control, and troubleshooting problems related to sequencing activities. In the sequence trimming algorithm it employs new features designed to improve performance with respect to concerns such as concatenated linkers, identification of the expected start position of a vector insert, and extending the useful length of trimmed sequences by bridging short regions of low quality when the following high quality segment is sufficiently long to justify doing so. CONCLUSION: MAGIC-SPP has been designed to minimize human error, while simultaneously being robust, versatile, flexible and automated. It offers a unique combination of features that permit administration by a biologist with little or no informatics background. It is well suited to both individual research programs and core facilities.
format Text
id pubmed-1421442
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14214422006-04-01 MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools Liang, Chun Sun, Feng Wang, Haiming Qu, Junfeng Freeman, Robert M Pratt, Lee H Cordonnier-Pratt, Marie-Michèle BMC Bioinformatics Software BACKGROUND: Processing raw DNA sequence data is an especially challenging task for relatively small laboratories and core facilities that produce as many as 5000 or more DNA sequences per week from multiple projects in widely differing species. To meet this challenge, we have developed the flexible, scalable, and automated sequence processing package described here. RESULTS: MAGIC-SPP is a DNA sequence processing package consisting of an Oracle 9i relational database, a Perl pipeline, and user interfaces implemented either as JavaServer Pages (JSP) or as a Java graphical user interface (GUI). The database not only serves as a data repository, but also controls processing of trace files. MAGIC-SPP includes an administrative interface, a laboratory information management system, and interfaces for exploring sequences, monitoring quality control, and troubleshooting problems related to sequencing activities. In the sequence trimming algorithm it employs new features designed to improve performance with respect to concerns such as concatenated linkers, identification of the expected start position of a vector insert, and extending the useful length of trimmed sequences by bridging short regions of low quality when the following high quality segment is sufficiently long to justify doing so. CONCLUSION: MAGIC-SPP has been designed to minimize human error, while simultaneously being robust, versatile, flexible and automated. It offers a unique combination of features that permit administration by a biologist with little or no informatics background. It is well suited to both individual research programs and core facilities. BioMed Central 2006-03-07 /pmc/articles/PMC1421442/ /pubmed/16522212 http://dx.doi.org/10.1186/1471-2105-7-115 Text en Copyright © 2006 Liang et al; licensee BioMed Central Ltd.
spellingShingle Software
Liang, Chun
Sun, Feng
Wang, Haiming
Qu, Junfeng
Freeman, Robert M
Pratt, Lee H
Cordonnier-Pratt, Marie-Michèle
MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title_full MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title_fullStr MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title_full_unstemmed MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title_short MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
title_sort magic-spp: a database-driven dna sequence processing package with associated management tools
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421442/
https://www.ncbi.nlm.nih.gov/pubmed/16522212
http://dx.doi.org/10.1186/1471-2105-7-115
work_keys_str_mv AT liangchun magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT sunfeng magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT wanghaiming magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT qujunfeng magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT freemanrobertm magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT prattleeh magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools
AT cordonnierprattmariemichele magicsppadatabasedrivendnasequenceprocessingpackagewithassociatedmanagementtools