Cargando…
Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly
Motivation: Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain m...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2781756/ https://www.ncbi.nlm.nih.gov/pubmed/19542148 http://dx.doi.org/10.1093/bioinformatics/btp377 |
_version_ | 1782174583354294272 |
---|---|
author | Dutilh, Bas E. Huynen, Martijn A. Strous, Marc |
author_facet | Dutilh, Bas E. Huynen, Martijn A. Strous, Marc |
author_sort | Dutilh, Bas E. |
collection | PubMed |
description | Motivation: Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain multiple related strains. This diversity can confound existing strict assembly programs and lead to a fragmented assembly, which is unnecessary if we have a related reference genome available that can function as a scaffold. Results: Here, we map short metagenomic sequencing reads from a population of strains to a related reference genome, and compose a genome that captures the consensus of the population's sequences. We show that by iteration of the mapping and assembly procedure, the coverage increases while the similarity with the reference genome decreases. This indicates that the assembly becomes less dependent on the reference genome and approaches the consensus genome of the multi-strain population. Contact: dutilh@cmbi.ru.nl Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2781756 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-27817562009-11-25 Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly Dutilh, Bas E. Huynen, Martijn A. Strous, Marc Bioinformatics Ismb/Eccb 2009 Special Interest Group on Short Read Sequencing Motivation: Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain multiple related strains. This diversity can confound existing strict assembly programs and lead to a fragmented assembly, which is unnecessary if we have a related reference genome available that can function as a scaffold. Results: Here, we map short metagenomic sequencing reads from a population of strains to a related reference genome, and compose a genome that captures the consensus of the population's sequences. We show that by iteration of the mapping and assembly procedure, the coverage increases while the similarity with the reference genome decreases. This indicates that the assembly becomes less dependent on the reference genome and approaches the consensus genome of the multi-strain population. Contact: dutilh@cmbi.ru.nl Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-11-01 2009-06-19 /pmc/articles/PMC2781756/ /pubmed/19542148 http://dx.doi.org/10.1093/bioinformatics/btp377 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb/Eccb 2009 Special Interest Group on Short Read Sequencing Dutilh, Bas E. Huynen, Martijn A. Strous, Marc Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title | Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title_full | Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title_fullStr | Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title_full_unstemmed | Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title_short | Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
title_sort | increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly |
topic | Ismb/Eccb 2009 Special Interest Group on Short Read Sequencing |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2781756/ https://www.ncbi.nlm.nih.gov/pubmed/19542148 http://dx.doi.org/10.1093/bioinformatics/btp377 |
work_keys_str_mv | AT dutilhbase increasingthecoverageofametapopulationconsensusgenomebyiterativereadmappingandassembly AT huynenmartijna increasingthecoverageofametapopulationconsensusgenomebyiterativereadmappingandassembly AT strousmarc increasingthecoverageofametapopulationconsensusgenomebyiterativereadmappingandassembly |