Cargando…

Hierarchical classification of functionally equivalent genes in prokaryotes

Functional classification of genes represents a fundamental problem to many biological studies. Most of the existing classification schemes are based on the concepts of homology and orthology, which were originally introduced to study gene evolution but might not be the most appropriate for gene fun...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Hongwei, Mao, Fenglou, Olman, Victor, Xu, Ying
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1874638/
https://www.ncbi.nlm.nih.gov/pubmed/17353185
http://dx.doi.org/10.1093/nar/gkl1114
_version_ 1782133493285781504
author Wu, Hongwei
Mao, Fenglou
Olman, Victor
Xu, Ying
author_facet Wu, Hongwei
Mao, Fenglou
Olman, Victor
Xu, Ying
author_sort Wu, Hongwei
collection PubMed
description Functional classification of genes represents a fundamental problem to many biological studies. Most of the existing classification schemes are based on the concepts of homology and orthology, which were originally introduced to study gene evolution but might not be the most appropriate for gene function prediction, particularly at high resolution level. We have recently developed a scheme for hierarchical classification of genes (HCGs) in prokaryotes. In the HCG scheme, the functional equivalence relationships among genes are first assessed through a careful application of both sequence similarity and genomic neighborhood information; and genes are then classified into a hierarchical structure of clusters, where genes in each cluster are functionally equivalent at some resolution level, and the level of resolution goes higher as the clusters become increasingly smaller traveling down the hierarchy. The HCG scheme is validated through comparisons with the taxonomy of the prokaryotic genomes, Clusters of Orthologous Groups (COGs) of genes and the Pfam system. We have applied the HCG scheme to 224 complete prokaryotic genomes, and constructed a HCG database consisting of a forest of 5339 multi-level and 15 770 single-level trees of gene clusters covering ∼93% of the genes of these 224 genomes. The validation results indicate that the HCG scheme not only captures the key features of the existing classification schemes but also provides a much richer organization of genes which can be used for functional prediction of genes at higher resolution and to help reveal evolutionary trace of the genes.
format Text
id pubmed-1874638
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-18746382007-05-25 Hierarchical classification of functionally equivalent genes in prokaryotes Wu, Hongwei Mao, Fenglou Olman, Victor Xu, Ying Nucleic Acids Res Computational Biology Functional classification of genes represents a fundamental problem to many biological studies. Most of the existing classification schemes are based on the concepts of homology and orthology, which were originally introduced to study gene evolution but might not be the most appropriate for gene function prediction, particularly at high resolution level. We have recently developed a scheme for hierarchical classification of genes (HCGs) in prokaryotes. In the HCG scheme, the functional equivalence relationships among genes are first assessed through a careful application of both sequence similarity and genomic neighborhood information; and genes are then classified into a hierarchical structure of clusters, where genes in each cluster are functionally equivalent at some resolution level, and the level of resolution goes higher as the clusters become increasingly smaller traveling down the hierarchy. The HCG scheme is validated through comparisons with the taxonomy of the prokaryotic genomes, Clusters of Orthologous Groups (COGs) of genes and the Pfam system. We have applied the HCG scheme to 224 complete prokaryotic genomes, and constructed a HCG database consisting of a forest of 5339 multi-level and 15 770 single-level trees of gene clusters covering ∼93% of the genes of these 224 genomes. The validation results indicate that the HCG scheme not only captures the key features of the existing classification schemes but also provides a much richer organization of genes which can be used for functional prediction of genes at higher resolution and to help reveal evolutionary trace of the genes. Oxford University Press 2007-04 2007-03-11 /pmc/articles/PMC1874638/ /pubmed/17353185 http://dx.doi.org/10.1093/nar/gkl1114 Text en © 2007 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Wu, Hongwei
Mao, Fenglou
Olman, Victor
Xu, Ying
Hierarchical classification of functionally equivalent genes in prokaryotes
title Hierarchical classification of functionally equivalent genes in prokaryotes
title_full Hierarchical classification of functionally equivalent genes in prokaryotes
title_fullStr Hierarchical classification of functionally equivalent genes in prokaryotes
title_full_unstemmed Hierarchical classification of functionally equivalent genes in prokaryotes
title_short Hierarchical classification of functionally equivalent genes in prokaryotes
title_sort hierarchical classification of functionally equivalent genes in prokaryotes
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1874638/
https://www.ncbi.nlm.nih.gov/pubmed/17353185
http://dx.doi.org/10.1093/nar/gkl1114
work_keys_str_mv AT wuhongwei hierarchicalclassificationoffunctionallyequivalentgenesinprokaryotes
AT maofenglou hierarchicalclassificationoffunctionallyequivalentgenesinprokaryotes
AT olmanvictor hierarchicalclassificationoffunctionallyequivalentgenesinprokaryotes
AT xuying hierarchicalclassificationoffunctionallyequivalentgenesinprokaryotes