Cargando…

Phylogenetic classification of short environmental DNA fragments

Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Krause, Lutz, Diaz, Naryttza N., Goesmann, Alexander, Kelley, Scott, Nattkemper, Tim W., Rohwer, Forest, Edwards, Robert A., Stoye, Jens
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367736/
https://www.ncbi.nlm.nih.gov/pubmed/18285365
http://dx.doi.org/10.1093/nar/gkn038
_version_ 1782154360924405760
author Krause, Lutz
Diaz, Naryttza N.
Goesmann, Alexander
Kelley, Scott
Nattkemper, Tim W.
Rohwer, Forest
Edwards, Robert A.
Stoye, Jens
author_facet Krause, Lutz
Diaz, Naryttza N.
Goesmann, Alexander
Kelley, Scott
Nattkemper, Tim W.
Rohwer, Forest
Edwards, Robert A.
Stoye, Jens
author_sort Krause, Lutz
collection PubMed
description Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed.
format Text
id pubmed-2367736
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-23677362008-05-07 Phylogenetic classification of short environmental DNA fragments Krause, Lutz Diaz, Naryttza N. Goesmann, Alexander Kelley, Scott Nattkemper, Tim W. Rohwer, Forest Edwards, Robert A. Stoye, Jens Nucleic Acids Res Computational Biology Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed. Oxford University Press 2008-04 2008-02-19 /pmc/articles/PMC2367736/ /pubmed/18285365 http://dx.doi.org/10.1093/nar/gkn038 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Krause, Lutz
Diaz, Naryttza N.
Goesmann, Alexander
Kelley, Scott
Nattkemper, Tim W.
Rohwer, Forest
Edwards, Robert A.
Stoye, Jens
Phylogenetic classification of short environmental DNA fragments
title Phylogenetic classification of short environmental DNA fragments
title_full Phylogenetic classification of short environmental DNA fragments
title_fullStr Phylogenetic classification of short environmental DNA fragments
title_full_unstemmed Phylogenetic classification of short environmental DNA fragments
title_short Phylogenetic classification of short environmental DNA fragments
title_sort phylogenetic classification of short environmental dna fragments
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367736/
https://www.ncbi.nlm.nih.gov/pubmed/18285365
http://dx.doi.org/10.1093/nar/gkn038
work_keys_str_mv AT krauselutz phylogeneticclassificationofshortenvironmentaldnafragments
AT diaznaryttzan phylogeneticclassificationofshortenvironmentaldnafragments
AT goesmannalexander phylogeneticclassificationofshortenvironmentaldnafragments
AT kelleyscott phylogeneticclassificationofshortenvironmentaldnafragments
AT nattkempertimw phylogeneticclassificationofshortenvironmentaldnafragments
AT rohwerforest phylogeneticclassificationofshortenvironmentaldnafragments
AT edwardsroberta phylogeneticclassificationofshortenvironmentaldnafragments
AT stoyejens phylogeneticclassificationofshortenvironmentaldnafragments