Cargando…

Efficient Synergistic Single-Cell Genome Assembly

As the vast majority of all microbes are unculturable, single-cell sequencing has become a significant method to gain insight into microbial physiology. Single-cell sequencing methods, currently powered by multiple displacement genome amplification (MDA), have passed important milestones such as fin...

Descripción completa

Detalles Bibliográficos
Autores principales: Movahedi, Narjes S., Embree, Mallory, Nagarajan, Harish, Zengler, Karsten, Chitsaz, Hamidreza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4876485/
https://www.ncbi.nlm.nih.gov/pubmed/27243002
http://dx.doi.org/10.3389/fbioe.2016.00042
_version_ 1782433247994576896
author Movahedi, Narjes S.
Embree, Mallory
Nagarajan, Harish
Zengler, Karsten
Chitsaz, Hamidreza
author_facet Movahedi, Narjes S.
Embree, Mallory
Nagarajan, Harish
Zengler, Karsten
Chitsaz, Hamidreza
author_sort Movahedi, Narjes S.
collection PubMed
description As the vast majority of all microbes are unculturable, single-cell sequencing has become a significant method to gain insight into microbial physiology. Single-cell sequencing methods, currently powered by multiple displacement genome amplification (MDA), have passed important milestones such as finishing and closing the genome of a prokaryote. However, the quality and reliability of genome assemblies from single cells are still unsatisfactory due to uneven coverage depth and the absence of scattered chunks of the genome in the final collection of reads caused by MDA bias. In this work, our new algorithm Hybrid De novo Assembler (HyDA) demonstrates the power of coassembly of multiple single-cell genomic data sets through significant improvement of the assembly quality in terms of predicted functional elements and length statistics. Coassemblies contain significantly more base pairs and protein coding genes, cover more subsystems, and consist of longer contigs compared to individual assemblies by the same algorithm as well as state-of-the-art single-cell assemblers SPAdes and IDBA-UD. Hybrid De novo Assembler (HyDA) is also able to avoid chimeric assemblies by detecting and separating shared and exclusive pieces of sequence for input data sets. By replacing one deep single-cell sequencing experiment with a few single-cell sequencing experiments of lower depth, the coassembly method can hedge against the risk of failure and loss of the sample, without significantly increasing sequencing cost. Application of the single-cell coassembler HyDA to the study of three uncultured members of an alkane-degrading methanogenic community validated the usefulness of the coassembly concept. HyDA is open source and publicly available at http://chitsazlab.org/software.html, and the raw reads are available at http://chitsazlab.org/research.html.
format Online
Article
Text
id pubmed-4876485
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-48764852016-05-30 Efficient Synergistic Single-Cell Genome Assembly Movahedi, Narjes S. Embree, Mallory Nagarajan, Harish Zengler, Karsten Chitsaz, Hamidreza Front Bioeng Biotechnol Bioengineering and Biotechnology As the vast majority of all microbes are unculturable, single-cell sequencing has become a significant method to gain insight into microbial physiology. Single-cell sequencing methods, currently powered by multiple displacement genome amplification (MDA), have passed important milestones such as finishing and closing the genome of a prokaryote. However, the quality and reliability of genome assemblies from single cells are still unsatisfactory due to uneven coverage depth and the absence of scattered chunks of the genome in the final collection of reads caused by MDA bias. In this work, our new algorithm Hybrid De novo Assembler (HyDA) demonstrates the power of coassembly of multiple single-cell genomic data sets through significant improvement of the assembly quality in terms of predicted functional elements and length statistics. Coassemblies contain significantly more base pairs and protein coding genes, cover more subsystems, and consist of longer contigs compared to individual assemblies by the same algorithm as well as state-of-the-art single-cell assemblers SPAdes and IDBA-UD. Hybrid De novo Assembler (HyDA) is also able to avoid chimeric assemblies by detecting and separating shared and exclusive pieces of sequence for input data sets. By replacing one deep single-cell sequencing experiment with a few single-cell sequencing experiments of lower depth, the coassembly method can hedge against the risk of failure and loss of the sample, without significantly increasing sequencing cost. Application of the single-cell coassembler HyDA to the study of three uncultured members of an alkane-degrading methanogenic community validated the usefulness of the coassembly concept. HyDA is open source and publicly available at http://chitsazlab.org/software.html, and the raw reads are available at http://chitsazlab.org/research.html. Frontiers Media S.A. 2016-05-23 /pmc/articles/PMC4876485/ /pubmed/27243002 http://dx.doi.org/10.3389/fbioe.2016.00042 Text en Copyright © 2016 Movahedi, Embree, Nagarajan, Zengler and Chitsaz. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Movahedi, Narjes S.
Embree, Mallory
Nagarajan, Harish
Zengler, Karsten
Chitsaz, Hamidreza
Efficient Synergistic Single-Cell Genome Assembly
title Efficient Synergistic Single-Cell Genome Assembly
title_full Efficient Synergistic Single-Cell Genome Assembly
title_fullStr Efficient Synergistic Single-Cell Genome Assembly
title_full_unstemmed Efficient Synergistic Single-Cell Genome Assembly
title_short Efficient Synergistic Single-Cell Genome Assembly
title_sort efficient synergistic single-cell genome assembly
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4876485/
https://www.ncbi.nlm.nih.gov/pubmed/27243002
http://dx.doi.org/10.3389/fbioe.2016.00042
work_keys_str_mv AT movahedinarjess efficientsynergisticsinglecellgenomeassembly
AT embreemallory efficientsynergisticsinglecellgenomeassembly
AT nagarajanharish efficientsynergisticsinglecellgenomeassembly
AT zenglerkarsten efficientsynergisticsinglecellgenomeassembly
AT chitsazhamidreza efficientsynergisticsinglecellgenomeassembly