Cargando…
Identification of discriminative characteristics for clusters from biologic data with InforBIO software
BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information....
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973088/ https://www.ncbi.nlm.nih.gov/pubmed/17683520 http://dx.doi.org/10.1186/1471-2105-8-281 |
_version_ | 1782135036925968384 |
---|---|
author | Tanaka, Naoto Uchino, Masataka Miyazaki, Satoru Sugawara, Hideaki |
author_facet | Tanaka, Naoto Uchino, Masataka Miyazaki, Satoru Sugawara, Hideaki |
author_sort | Tanaka, Naoto |
collection | PubMed |
description | BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information. However, the integration of genotypic and phenotypic information for polyphasic studies is necessary for the classification and identification of microbes. Thus, we devised an algorithm that objectively identifies discriminative characteristics for focused clusters on generated trees from a dataset composed of coded data, such as phenotypic information. Moreover, this algorithm has been integrated into the polyphasic analysis software, InforBIO. RESULTS: We developed a differential-character-finding algorithm based on information measures and used this algorithm to identify the characteristic that best discriminates operational taxonomic unit clusters. For all characteristics in a dataset, the algorithm estimates commonality in focused clusters and diversity among clusters by scoring based on Shannon's and relative entropies. All the characteristics selected for scoring are equally weighted. Thresholds for the scores are defined to identify discriminative characteristics for clusters efficiently from a database. The unique feature of the algorithm, which is implemented in the InforBIO software, is that it can identify the phenotypic characteristics that discriminate and are associated with the clusters of a phylogenetic tree. We successfully applied this algorithm to the study of phylogenetic clusters of Pseudomonas species. CONCLUSION: The algorithm in the InforBIO software is a novel and useful approach for microbial polyphasic studies. The algorithm can also be applied to diverse cluster analyses. The InforBIO software is available from the download site . This software is free for personal but not commercial use. |
format | Text |
id | pubmed-1973088 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19730882007-09-08 Identification of discriminative characteristics for clusters from biologic data with InforBIO software Tanaka, Naoto Uchino, Masataka Miyazaki, Satoru Sugawara, Hideaki BMC Bioinformatics Software BACKGROUND: There are a number of different methods for generation of trees and algorithms for phylogenetic analysis in the study of bacterial taxonomy. Genotypic information, such as SSU rRNA gene sequences, now plays a more prominent role in microbial systematics than does phenotypic information. However, the integration of genotypic and phenotypic information for polyphasic studies is necessary for the classification and identification of microbes. Thus, we devised an algorithm that objectively identifies discriminative characteristics for focused clusters on generated trees from a dataset composed of coded data, such as phenotypic information. Moreover, this algorithm has been integrated into the polyphasic analysis software, InforBIO. RESULTS: We developed a differential-character-finding algorithm based on information measures and used this algorithm to identify the characteristic that best discriminates operational taxonomic unit clusters. For all characteristics in a dataset, the algorithm estimates commonality in focused clusters and diversity among clusters by scoring based on Shannon's and relative entropies. All the characteristics selected for scoring are equally weighted. Thresholds for the scores are defined to identify discriminative characteristics for clusters efficiently from a database. The unique feature of the algorithm, which is implemented in the InforBIO software, is that it can identify the phenotypic characteristics that discriminate and are associated with the clusters of a phylogenetic tree. We successfully applied this algorithm to the study of phylogenetic clusters of Pseudomonas species. CONCLUSION: The algorithm in the InforBIO software is a novel and useful approach for microbial polyphasic studies. The algorithm can also be applied to diverse cluster analyses. The InforBIO software is available from the download site . This software is free for personal but not commercial use. BioMed Central 2007-08-02 /pmc/articles/PMC1973088/ /pubmed/17683520 http://dx.doi.org/10.1186/1471-2105-8-281 Text en Copyright © 2007 Tanaka et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Tanaka, Naoto Uchino, Masataka Miyazaki, Satoru Sugawara, Hideaki Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title | Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title_full | Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title_fullStr | Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title_full_unstemmed | Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title_short | Identification of discriminative characteristics for clusters from biologic data with InforBIO software |
title_sort | identification of discriminative characteristics for clusters from biologic data with inforbio software |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973088/ https://www.ncbi.nlm.nih.gov/pubmed/17683520 http://dx.doi.org/10.1186/1471-2105-8-281 |
work_keys_str_mv | AT tanakanaoto identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware AT uchinomasataka identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware AT miyazakisatoru identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware AT sugawarahideaki identificationofdiscriminativecharacteristicsforclustersfrombiologicdatawithinforbiosoftware |