Cargando…

GSTaxClassifier: a genomic signature based taxonomic classifier for metagenomic data analysis

: 1. a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases; 2. a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and h...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Fahong, Sun, Yijun, Liu, Li, Farmerie, William
Formato: Texto
Lenguaje:English
Publicado: Biomedical Informatics 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2770370/
https://www.ncbi.nlm.nih.gov/pubmed/20011152
Descripción
Sumario:: 1. a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases; 2. a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and hexa-nucleotide motifs for setting a user-defined database; ; 3. two different formats (tabular- and tree-based summaries) to display taxonomic predictions with improved analytical methods; and 4. effective ways to retrieve, search, and summarize results by integrating the predictions into the NCBI tree-based taxonomic information. GSTaxClassifier takes input nucleotide sequences and using a modified Bayesian model evaluates the genomic signatures between metagenomic query sequences and reference genome databases. The simulation studies of a numerical data sets showed that GSTaxClassifier could serve as a useful program for metagenomics studies, which is freely available at http://helix2.biotech.ufl.edu:26878/metagenomics/.