Cargando…
LOCAS – A Low Coverage Assembly Tool for Resequencing Projects
MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Her...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3156226/ https://www.ncbi.nlm.nih.gov/pubmed/21858125 http://dx.doi.org/10.1371/journal.pone.0023455 |
_version_ | 1782210181954797568 |
---|---|
author | Klein, Juliane D. Ossowski, Stephan Schneeberger, Korbinian Weigel, Detlef Huson, Daniel H. |
author_facet | Klein, Juliane D. Ossowski, Stephan Schneeberger, Korbinian Weigel, Detlef Huson, Daniel H. |
author_sort | Klein, Juliane D. |
collection | PubMed |
description | MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. RESULTS: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homology-guided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. CONCLUSION: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequence variations in this scenario. |
format | Online Article Text |
id | pubmed-3156226 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31562262011-08-19 LOCAS – A Low Coverage Assembly Tool for Resequencing Projects Klein, Juliane D. Ossowski, Stephan Schneeberger, Korbinian Weigel, Detlef Huson, Daniel H. PLoS One Research Article MOTIVATION: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. RESULTS: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homology-guided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. CONCLUSION: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequence variations in this scenario. Public Library of Science 2011-08-15 /pmc/articles/PMC3156226/ /pubmed/21858125 http://dx.doi.org/10.1371/journal.pone.0023455 Text en Klein et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Klein, Juliane D. Ossowski, Stephan Schneeberger, Korbinian Weigel, Detlef Huson, Daniel H. LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title | LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title_full | LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title_fullStr | LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title_full_unstemmed | LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title_short | LOCAS – A Low Coverage Assembly Tool for Resequencing Projects |
title_sort | locas – a low coverage assembly tool for resequencing projects |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3156226/ https://www.ncbi.nlm.nih.gov/pubmed/21858125 http://dx.doi.org/10.1371/journal.pone.0023455 |
work_keys_str_mv | AT kleinjulianed locasalowcoverageassemblytoolforresequencingprojects AT ossowskistephan locasalowcoverageassemblytoolforresequencingprojects AT schneebergerkorbinian locasalowcoverageassemblytoolforresequencingprojects AT weigeldetlef locasalowcoverageassemblytoolforresequencingprojects AT husondanielh locasalowcoverageassemblytoolforresequencingprojects |