Cargando…

Mauve Assembly Metrics

Summary: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling...

Descripción completa

Detalles Bibliográficos
Autores principales: Darling, Aaron E., Tritt, Andrew, Eisen, Jonathan A., Facciotti, Marc T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179657/
https://www.ncbi.nlm.nih.gov/pubmed/21810901
http://dx.doi.org/10.1093/bioinformatics/btr451
_version_ 1782212538724777984
author Darling, Aaron E.
Tritt, Andrew
Eisen, Jonathan A.
Facciotti, Marc T.
author_facet Darling, Aaron E.
Tritt, Andrew
Eisen, Jonathan A.
Facciotti, Marc T.
author_sort Darling, Aaron E.
collection PubMed
description Summary: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scoring metrics, and to compare assembly quality across a variety of assemblers, sequence data types, and parameter choices. When used in conjunction with training data such as a high-quality reference genome and sequence reads from the same organism, our program can be used to manually identify an optimal sequencing and assembly strategy for de novo sequencing of related organisms. Availability: GPL source code and a usage tutorial is at http://ngopt.googlecode.com Contact: aarondarling@ucdavis.edu Supplementary information: Supplementary data is available at Bioinformatics online.
format Online
Article
Text
id pubmed-3179657
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31796572011-09-26 Mauve Assembly Metrics Darling, Aaron E. Tritt, Andrew Eisen, Jonathan A. Facciotti, Marc T. Bioinformatics Applications Note Summary: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scoring metrics, and to compare assembly quality across a variety of assemblers, sequence data types, and parameter choices. When used in conjunction with training data such as a high-quality reference genome and sequence reads from the same organism, our program can be used to manually identify an optimal sequencing and assembly strategy for de novo sequencing of related organisms. Availability: GPL source code and a usage tutorial is at http://ngopt.googlecode.com Contact: aarondarling@ucdavis.edu Supplementary information: Supplementary data is available at Bioinformatics online. Oxford University Press 2011-10-01 2011-08-02 /pmc/articles/PMC3179657/ /pubmed/21810901 http://dx.doi.org/10.1093/bioinformatics/btr451 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Darling, Aaron E.
Tritt, Andrew
Eisen, Jonathan A.
Facciotti, Marc T.
Mauve Assembly Metrics
title Mauve Assembly Metrics
title_full Mauve Assembly Metrics
title_fullStr Mauve Assembly Metrics
title_full_unstemmed Mauve Assembly Metrics
title_short Mauve Assembly Metrics
title_sort mauve assembly metrics
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179657/
https://www.ncbi.nlm.nih.gov/pubmed/21810901
http://dx.doi.org/10.1093/bioinformatics/btr451
work_keys_str_mv AT darlingaarone mauveassemblymetrics
AT trittandrew mauveassemblymetrics
AT eisenjonathana mauveassemblymetrics
AT facciottimarct mauveassemblymetrics