Cargando…

Near-optimal assembly for shotgun sequencing with noisy reads

Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Lam, Ka-Kit, Khalak, Asif, Tse, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168708/
https://www.ncbi.nlm.nih.gov/pubmed/25252708
http://dx.doi.org/10.1186/1471-2105-15-S9-S4
_version_ 1782335603457654784
author Lam, Ka-Kit
Khalak, Asif
Tse, David
author_facet Lam, Ka-Kit
Khalak, Asif
Tse, David
author_sort Lam, Ka-Kit
collection PubMed
description Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, we show that even when there is noise in the reads, one can successfully reconstruct with information requirements close to the noiseless fundamental limit. A new assembly algorithm, X-phased Multibridging, is designed based on a probabilistic model of the genome. It is shown through analysis to perform well on the model, and through simulations to perform well on real genomes.
format Online
Article
Text
id pubmed-4168708
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41687082014-10-02 Near-optimal assembly for shotgun sequencing with noisy reads Lam, Ka-Kit Khalak, Asif Tse, David BMC Bioinformatics Proceedings Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, we show that even when there is noise in the reads, one can successfully reconstruct with information requirements close to the noiseless fundamental limit. A new assembly algorithm, X-phased Multibridging, is designed based on a probabilistic model of the genome. It is shown through analysis to perform well on the model, and through simulations to perform well on real genomes. BioMed Central 2014-09-10 /pmc/articles/PMC4168708/ /pubmed/25252708 http://dx.doi.org/10.1186/1471-2105-15-S9-S4 Text en Copyright © 2014 Lam et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Lam, Ka-Kit
Khalak, Asif
Tse, David
Near-optimal assembly for shotgun sequencing with noisy reads
title Near-optimal assembly for shotgun sequencing with noisy reads
title_full Near-optimal assembly for shotgun sequencing with noisy reads
title_fullStr Near-optimal assembly for shotgun sequencing with noisy reads
title_full_unstemmed Near-optimal assembly for shotgun sequencing with noisy reads
title_short Near-optimal assembly for shotgun sequencing with noisy reads
title_sort near-optimal assembly for shotgun sequencing with noisy reads
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168708/
https://www.ncbi.nlm.nih.gov/pubmed/25252708
http://dx.doi.org/10.1186/1471-2105-15-S9-S4
work_keys_str_mv AT lamkakit nearoptimalassemblyforshotgunsequencingwithnoisyreads
AT khalakasif nearoptimalassemblyforshotgunsequencingwithnoisyreads
AT tsedavid nearoptimalassemblyforshotgunsequencingwithnoisyreads