Cargando…

Identification of discriminative characteristics for clusters from biologic data with InforBIO software

BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information....

Descripción completa

Detalles Bibliográficos
Autores principales: Tanaka, Naoto, Uchino, Masataka, Miyazaki, Satoru, Sugawara, Hideaki
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973088/
https://www.ncbi.nlm.nih.gov/pubmed/17683520
http://dx.doi.org/10.1186/1471-2105-8-281
_version_ 1782135036925968384
author Tanaka, Naoto
Uchino, Masataka
Miyazaki, Satoru
Sugawara, Hideaki
author_facet Tanaka, Naoto
Uchino, Masataka
Miyazaki, Satoru
Sugawara, Hideaki
author_sort Tanaka, Naoto
collection PubMed
description BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information. However, the integration of genotypic and phenotypic information for polyphasic studies is necessary for the classification and identification of microbes. Thus, we devised an algorithm that objectively identifies discriminative characteristics for focused clusters on generated trees from a dataset composed of coded data, such as phenotypic information. Moreover, this algorithm has been integrated into the polyphasic analysis software, InforBIO. RESULTS: We developed a differential-character-finding algorithm based on information measures and used this algorithm to identify the characteristic that best discriminates operational taxonomic unit clusters. For all characteristics in a dataset, the algorithm estimates commonality in focused clusters and diversity among clusters by scoring based on Shannon's and relative entropies. All the characteristics selected for scoring are equally weighted. Thresholds for the scores are defined to identify discriminative characteristics for clusters efficiently from a database. The unique feature of the algorithm, which is implemented in the InforBIO software, is that it can identify the phenotypic characteristics that discriminate and are associated with the clusters of a phylogenetic tree. We successfully applied this algorithm to the study of phylogenetic clusters of Pseudomonas species. CONCLUSION: The algorithm in the InforBIO software is a novel and useful approach for microbial polyphasic studies. The algorithm can also be applied to diverse cluster analyses. The InforBIO software is available from the download site . This software is free for personal but not commercial use.
format Text
id pubmed-1973088
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19730882007-09-08 Identification of discriminative characteristics for clusters from biologic data with InforBIO software Tanaka, Naoto Uchino, Masataka Miyazaki, Satoru Sugawara, Hideaki BMC Bioinformatics Software BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information. However, the integration of genotypic and phenotypic information for polyphasic studies is necessary for the classification and identification of microbes. Thus, we devised an algorithm that objectively identifies discriminative characteristics for focused clusters on generated trees from a dataset composed of coded data, such as phenotypic information. Moreover, this algorithm has been integrated into the polyphasic analysis software, InforBIO. RESULTS: We developed a differential-character-finding algorithm based on information measures and used this algorithm to identify the characteristic that best discriminates operational taxonomic unit clusters. For all characteristics in a dataset, the algorithm estimates commonality in focused clusters and diversity among clusters by scoring based on Shannon's and relative entropies. All the characteristics selected for scoring are equally weighted. Thresholds for the scores are defined to identify discriminative characteristics for clusters efficiently from a database. The unique feature of the algorithm, which is implemented in the InforBIO software, is that it can identify the phenotypic characteristics that discriminate and are associated with the clusters of a phylogenetic tree. We successfully applied this algorithm to the study of phylogenetic clusters of Pseudomonas species. CONCLUSION: The algorithm in the InforBIO software is a novel and useful approach for microbial polyphasic studies. The algorithm can also be applied to diverse cluster analyses. The InforBIO software is available from the download site . This software is free for personal but not commercial use. BioMed Central 2007-08-02 /pmc/articles/PMC1973088/ /pubmed/17683520 http://dx.doi.org/10.1186/1471-2105-8-281 Text en Copyright © 2007 Tanaka et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Tanaka, Naoto
Uchino, Masataka
Miyazaki, Satoru
Sugawara, Hideaki
Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title_full Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title_fullStr Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title_full_unstemmed Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title_short Identification of discriminative characteristics for clusters from biologic data with InforBIO software
title_sort identification of discriminative characteristics for clusters from biologic data with inforbio software
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973088/
https://www.ncbi.nlm.nih.gov/pubmed/17683520
http://dx.doi.org/10.1186/1471-2105-8-281
work_keys_str_mv AT tanakanaoto identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware
AT uchinomasataka identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware
AT miyazakisatoru identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware
AT sugawarahideaki identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware