Cargando…

LOCAS – A Low Coverage Assembly Tool for Resequencing Projects

MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Her...

Descripción completa

Detalles Bibliográficos
Autores principales: Klein, Juliane D., Ossowski, Stephan, Schneeberger, Korbinian, Weigel, Detlef, Huson, Daniel H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3156226/
https://www.ncbi.nlm.nih.gov/pubmed/21858125
http://dx.doi.org/10.1371/journal.pone.0023455
_version_ 1782210181954797568
author Klein, Juliane D.
Ossowski, Stephan
Schneeberger, Korbinian
Weigel, Detlef
Huson, Daniel H.
author_facet Klein, Juliane D.
Ossowski, Stephan
Schneeberger, Korbinian
Weigel, Detlef
Huson, Daniel H.
author_sort Klein, Juliane D.
collection PubMed
description MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. RESULTS: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homology-guided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. CONCLUSION: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequence variations in this scenario.
format Online
Article
Text
id pubmed-3156226
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31562262011-08-19 LOCAS – A Low Coverage Assembly Tool for Resequencing Projects Klein, Juliane D. Ossowski, Stephan Schneeberger, Korbinian Weigel, Detlef Huson, Daniel H. PLoS One Research Article MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. RESULTS: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homology-guided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. CONCLUSION: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequence variations in this scenario. Public Library of Science 2011-08-15 /pmc/articles/PMC3156226/ /pubmed/21858125 http://dx.doi.org/10.1371/journal.pone.0023455 Text en Klein et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Klein, Juliane D.
Ossowski, Stephan
Schneeberger, Korbinian
Weigel, Detlef
Huson, Daniel H.
LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title_full LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title_fullStr LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title_full_unstemmed LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title_short LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
title_sort locas – a low coverage assembly tool for resequencing projects
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3156226/
https://www.ncbi.nlm.nih.gov/pubmed/21858125
http://dx.doi.org/10.1371/journal.pone.0023455
work_keys_str_mv AT kleinjulianed locasalowcoverageassemblytoolforresequencingprojects
AT ossowskistephan locasalowcoverageassemblytoolforresequencingprojects
AT schneebergerkorbinian locasalowcoverageassemblytoolforresequencingprojects
AT weigeldetlef locasalowcoverageassemblytoolforresequencingprojects
AT husondanielh locasalowcoverageassemblytoolforresequencingprojects