Cargando…

Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms

Compared with traditional algorithms for long metagenomic sequence classification, characterizing microorganisms’ taxonomic and functional abundance based on tens of millions of very short reads are much more challenging. We describe an efficient composition and phylogeny-based algorithm [Metagenome...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jiemeng, Wang, Haifeng, Yang, Hongxing, Zhang, Yizhe, Wang, Jinfeng, Zhao, Fangqing, Qi, Ji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592448/
https://www.ncbi.nlm.nih.gov/pubmed/22941634
http://dx.doi.org/10.1093/nar/gks828
_version_ 1782262118300516352
author Liu, Jiemeng
Wang, Haifeng
Yang, Hongxing
Zhang, Yizhe
Wang, Jinfeng
Zhao, Fangqing
Qi, Ji
author_facet Liu, Jiemeng
Wang, Haifeng
Yang, Hongxing
Zhang, Yizhe
Wang, Jinfeng
Zhao, Fangqing
Qi, Ji
author_sort Liu, Jiemeng
collection PubMed
description Compared with traditional algorithms for long metagenomic sequence classification, characterizing microorganisms’ taxonomic and functional abundance based on tens of millions of very short reads are much more challenging. We describe an efficient composition and phylogeny-based algorithm [Metagenome Composition Vector (MetaCV)] to classify very short metagenomic reads (75–100 bp) into specific taxonomic and functional groups. We applied MetaCV to the Meta-HIT data (371-Gb 75-bp reads of 109 human gut metagenomes), and this single-read-based, instead of assembly-based, classification has a high resolution to characterize the composition and structure of human gut microbiota, especially for low abundance species. Most strikingly, it only took MetaCV 10 days to do all the computation work on a server with five 24-core nodes. To our knowledge, MetaCV, benefited from the strategy of composition comparison, is the first algorithm that can classify millions of very short reads within affordable time.
format Online
Article
Text
id pubmed-3592448
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35924482013-03-08 Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms Liu, Jiemeng Wang, Haifeng Yang, Hongxing Zhang, Yizhe Wang, Jinfeng Zhao, Fangqing Qi, Ji Nucleic Acids Res Methods Online Compared with traditional algorithms for long metagenomic sequence classification, characterizing microorganisms’ taxonomic and functional abundance based on tens of millions of very short reads are much more challenging. We describe an efficient composition and phylogeny-based algorithm [Metagenome Composition Vector (MetaCV)] to classify very short metagenomic reads (75–100 bp) into specific taxonomic and functional groups. We applied MetaCV to the Meta-HIT data (371-Gb 75-bp reads of 109 human gut metagenomes), and this single-read-based, instead of assembly-based, classification has a high resolution to characterize the composition and structure of human gut microbiota, especially for low abundance species. Most strikingly, it only took MetaCV 10 days to do all the computation work on a server with five 24-core nodes. To our knowledge, MetaCV, benefited from the strategy of composition comparison, is the first algorithm that can classify millions of very short reads within affordable time. Oxford University Press 2013-01 2012-08-31 /pmc/articles/PMC3592448/ /pubmed/22941634 http://dx.doi.org/10.1093/nar/gks828 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Liu, Jiemeng
Wang, Haifeng
Yang, Hongxing
Zhang, Yizhe
Wang, Jinfeng
Zhao, Fangqing
Qi, Ji
Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title_full Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title_fullStr Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title_full_unstemmed Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title_short Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
title_sort composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592448/
https://www.ncbi.nlm.nih.gov/pubmed/22941634
http://dx.doi.org/10.1093/nar/gks828
work_keys_str_mv AT liujiemeng compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT wanghaifeng compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT yanghongxing compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT zhangyizhe compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT wangjinfeng compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT zhaofangqing compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms
AT qiji compositionbasedclassificationofshortmetagenomicsequenceselucidatesthelandscapesoftaxonomicandfunctionalenrichmentofmicroorganisms