Cargando…

A new strategy for better genome assembly from very short reads

BACKGROUND: With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue. RESULTS: A novel stra...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Yan, Shi, Yixiang, Ding, Guohui, Li, Yixue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3268122/
https://www.ncbi.nlm.nih.gov/pubmed/22208765
http://dx.doi.org/10.1186/1471-2105-12-493
_version_ 1782222354210881536
author Ji, Yan
Shi, Yixiang
Ding, Guohui
Li, Yixue
author_facet Ji, Yan
Shi, Yixiang
Ding, Guohui
Li, Yixue
author_sort Ji, Yan
collection PubMed
description BACKGROUND: With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue. RESULTS: A novel strategy for improving genome assembly from very short reads is proposed. It can increase accuracies of assemblies by integrating de novo contigs, and produce comparative contigs by allowing multiple references without limiting to genomes of closely related strains. Comparative contigs are used to scaffold de novo contigs. Using simulated and real datasets, it is shown that our strategy can effectively improve qualities of assemblies of isolated microbial genomes and metagenomes. CONCLUSIONS: With more and more reference genomes available, our strategy will be useful to improve qualities of genome assemblies from very short reads. Some scripts are provided to make our strategy applicable at http://code.google.com/p/cd-hybrid/.
format Online
Article
Text
id pubmed-3268122
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32681222012-01-30 A new strategy for better genome assembly from very short reads Ji, Yan Shi, Yixiang Ding, Guohui Li, Yixue BMC Bioinformatics Methodology Article BACKGROUND: With the rapid development of the next generation sequencing (NGS) technology, large quantities of genome sequencing data have been generated. Because of repetitive regions of genomes and some other factors, assembly of very short reads is still a challenging issue. RESULTS: A novel strategy for improving genome assembly from very short reads is proposed. It can increase accuracies of assemblies by integrating de novo contigs, and produce comparative contigs by allowing multiple references without limiting to genomes of closely related strains. Comparative contigs are used to scaffold de novo contigs. Using simulated and real datasets, it is shown that our strategy can effectively improve qualities of assemblies of isolated microbial genomes and metagenomes. CONCLUSIONS: With more and more reference genomes available, our strategy will be useful to improve qualities of genome assemblies from very short reads. Some scripts are provided to make our strategy applicable at http://code.google.com/p/cd-hybrid/. BioMed Central 2011-12-30 /pmc/articles/PMC3268122/ /pubmed/22208765 http://dx.doi.org/10.1186/1471-2105-12-493 Text en Copyright ©2011 Ji et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ji, Yan
Shi, Yixiang
Ding, Guohui
Li, Yixue
A new strategy for better genome assembly from very short reads
title A new strategy for better genome assembly from very short reads
title_full A new strategy for better genome assembly from very short reads
title_fullStr A new strategy for better genome assembly from very short reads
title_full_unstemmed A new strategy for better genome assembly from very short reads
title_short A new strategy for better genome assembly from very short reads
title_sort new strategy for better genome assembly from very short reads
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3268122/
https://www.ncbi.nlm.nih.gov/pubmed/22208765
http://dx.doi.org/10.1186/1471-2105-12-493
work_keys_str_mv AT jiyan anewstrategyforbettergenomeassemblyfromveryshortreads
AT shiyixiang anewstrategyforbettergenomeassemblyfromveryshortreads
AT dingguohui anewstrategyforbettergenomeassemblyfromveryshortreads
AT liyixue anewstrategyforbettergenomeassemblyfromveryshortreads
AT jiyan newstrategyforbettergenomeassemblyfromveryshortreads
AT shiyixiang newstrategyforbettergenomeassemblyfromveryshortreads
AT dingguohui newstrategyforbettergenomeassemblyfromveryshortreads
AT liyixue newstrategyforbettergenomeassemblyfromveryshortreads