Cargando…

Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters

BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-o...

Descripción completa

Detalles Bibliográficos
Autores principales: Pejaver, Vikas Rao, Kim, Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194233/
https://www.ncbi.nlm.nih.gov/pubmed/21989079
http://dx.doi.org/10.1186/1471-2164-12-S2-S2
_version_ 1782213934159233024
author Pejaver, Vikas Rao
Kim, Sun
author_facet Pejaver, Vikas Rao
Kim, Sun
author_sort Pejaver, Vikas Rao
collection PubMed
description BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-occurring gene sets. However, both methods predict many false positives and false negatives. In this paper, we propose the Gene Cluster Profile Vector (GCPV) method, which combines these two methods by using phylogenetic profiles of whole gene clusters. The GCPV method is, currently, the only genome comparison based method that allows for the characterization of relationships between gene clusters based profiles of individual genes in clusters. RESULTS: The GCPV method groups together reasonably related operons in E. coli about 60% of the time. The method is not sensitive to the choice of a reference genome set used and it outperforms the conventional phylogenetic profiles method. Finally, we show that the method works well for predicted gene clusters from C. crescentus and can serve as an important tool not only for understanding gene function, but also for elucidating mechanisms of general biological processes. CONCLUSIONS: The GCPV method has shown to be an effective and robust approach to the prediction of functionally related gene sets from proximity-based gene clusters or operons.
format Online
Article
Text
id pubmed-3194233
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31942332011-10-17 Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters Pejaver, Vikas Rao Kim, Sun BMC Genomics Proceedings BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-occurring gene sets. However, both methods predict many false positives and false negatives. In this paper, we propose the Gene Cluster Profile Vector (GCPV) method, which combines these two methods by using phylogenetic profiles of whole gene clusters. The GCPV method is, currently, the only genome comparison based method that allows for the characterization of relationships between gene clusters based profiles of individual genes in clusters. RESULTS: The GCPV method groups together reasonably related operons in E. coli about 60% of the time. The method is not sensitive to the choice of a reference genome set used and it outperforms the conventional phylogenetic profiles method. Finally, we show that the method works well for predicted gene clusters from C. crescentus and can serve as an important tool not only for understanding gene function, but also for elucidating mechanisms of general biological processes. CONCLUSIONS: The GCPV method has shown to be an effective and robust approach to the prediction of functionally related gene sets from proximity-based gene clusters or operons. BioMed Central 2011-07-27 /pmc/articles/PMC3194233/ /pubmed/21989079 http://dx.doi.org/10.1186/1471-2164-12-S2-S2 Text en Copyright ©2011 Pejaver and Kim; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Pejaver, Vikas Rao
Kim, Sun
Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title_full Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title_fullStr Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title_full_unstemmed Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title_short Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
title_sort gene cluster profile vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194233/
https://www.ncbi.nlm.nih.gov/pubmed/21989079
http://dx.doi.org/10.1186/1471-2164-12-S2-S2
work_keys_str_mv AT pejavervikasrao geneclusterprofilevectorsamethodtoinferfunctionallyrelatedgenesetsbygroupingproximitybasedgeneclusters
AT kimsun geneclusterprofilevectorsamethodtoinferfunctionallyrelatedgenesetsbygroupingproximitybasedgeneclusters