Cargando…
Near-optimal assembly for shotgun sequencing with noisy reads
Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, w...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168708/ https://www.ncbi.nlm.nih.gov/pubmed/25252708 http://dx.doi.org/10.1186/1471-2105-15-S9-S4 |
_version_ | 1782335603457654784 |
---|---|
author | Lam, Ka-Kit Khalak, Asif Tse, David |
author_facet | Lam, Ka-Kit Khalak, Asif Tse, David |
author_sort | Lam, Ka-Kit |
collection | PubMed |
description | Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, we show that even when there is noise in the reads, one can successfully reconstruct with information requirements close to the noiseless fundamental limit. A new assembly algorithm, X-phased Multibridging, is designed based on a probabilistic model of the genome. It is shown through analysis to perform well on the model, and through simulations to perform well on real genomes. |
format | Online Article Text |
id | pubmed-4168708 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41687082014-10-02 Near-optimal assembly for shotgun sequencing with noisy reads Lam, Ka-Kit Khalak, Asif Tse, David BMC Bioinformatics Proceedings Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, we show that even when there is noise in the reads, one can successfully reconstruct with information requirements close to the noiseless fundamental limit. A new assembly algorithm, X-phased Multibridging, is designed based on a probabilistic model of the genome. It is shown through analysis to perform well on the model, and through simulations to perform well on real genomes. BioMed Central 2014-09-10 /pmc/articles/PMC4168708/ /pubmed/25252708 http://dx.doi.org/10.1186/1471-2105-15-S9-S4 Text en Copyright © 2014 Lam et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Lam, Ka-Kit Khalak, Asif Tse, David Near-optimal assembly for shotgun sequencing with noisy reads |
title | Near-optimal assembly for shotgun sequencing with noisy reads |
title_full | Near-optimal assembly for shotgun sequencing with noisy reads |
title_fullStr | Near-optimal assembly for shotgun sequencing with noisy reads |
title_full_unstemmed | Near-optimal assembly for shotgun sequencing with noisy reads |
title_short | Near-optimal assembly for shotgun sequencing with noisy reads |
title_sort | near-optimal assembly for shotgun sequencing with noisy reads |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168708/ https://www.ncbi.nlm.nih.gov/pubmed/25252708 http://dx.doi.org/10.1186/1471-2105-15-S9-S4 |
work_keys_str_mv | AT lamkakit nearoptimalassemblyforshotgunsequencingwithnoisyreads AT khalakasif nearoptimalassemblyforshotgunsequencingwithnoisyreads AT tsedavid nearoptimalassemblyforshotgunsequencingwithnoisyreads |