Cargando…
Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters
BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-o...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194233/ https://www.ncbi.nlm.nih.gov/pubmed/21989079 http://dx.doi.org/10.1186/1471-2164-12-S2-S2 |
_version_ | 1782213934159233024 |
---|---|
author | Pejaver, Vikas Rao Kim, Sun |
author_facet | Pejaver, Vikas Rao Kim, Sun |
author_sort | Pejaver, Vikas Rao |
collection | PubMed |
description | BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-occurring gene sets. However, both methods predict many false positives and false negatives. In this paper, we propose the Gene Cluster Profile Vector (GCPV) method, which combines these two methods by using phylogenetic profiles of whole gene clusters. The GCPV method is, currently, the only genome comparison based method that allows for the characterization of relationships between gene clusters based profiles of individual genes in clusters. RESULTS: The GCPV method groups together reasonably related operons in E. coli about 60% of the time. The method is not sensitive to the choice of a reference genome set used and it outperforms the conventional phylogenetic profiles method. Finally, we show that the method works well for predicted gene clusters from C. crescentus and can serve as an important tool not only for understanding gene function, but also for elucidating mechanisms of general biological processes. CONCLUSIONS: The GCPV method has shown to be an effective and robust approach to the prediction of functionally related gene sets from proximity-based gene clusters or operons. |
format | Online Article Text |
id | pubmed-3194233 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31942332011-10-17 Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters Pejaver, Vikas Rao Kim, Sun BMC Genomics Proceedings BACKGROUND: Proximity-based methods and co-evolution-based phylogenetic profiles methods have been successfully used for the identification of functionally related genes. Proximity-based methods are effective for physically clustered genes while the phylogenetic profiles method is effective for co-occurring gene sets. However, both methods predict many false positives and false negatives. In this paper, we propose the Gene Cluster Profile Vector (GCPV) method, which combines these two methods by using phylogenetic profiles of whole gene clusters. The GCPV method is, currently, the only genome comparison based method that allows for the characterization of relationships between gene clusters based profiles of individual genes in clusters. RESULTS: The GCPV method groups together reasonably related operons in E. coli about 60% of the time. The method is not sensitive to the choice of a reference genome set used and it outperforms the conventional phylogenetic profiles method. Finally, we show that the method works well for predicted gene clusters from C. crescentus and can serve as an important tool not only for understanding gene function, but also for elucidating mechanisms of general biological processes. CONCLUSIONS: The GCPV method has shown to be an effective and robust approach to the prediction of functionally related gene sets from proximity-based gene clusters or operons. BioMed Central 2011-07-27 /pmc/articles/PMC3194233/ /pubmed/21989079 http://dx.doi.org/10.1186/1471-2164-12-S2-S2 Text en Copyright ©2011 Pejaver and Kim; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Pejaver, Vikas Rao Kim, Sun Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title | Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title_full | Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title_fullStr | Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title_full_unstemmed | Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title_short | Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
title_sort | gene cluster profile vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194233/ https://www.ncbi.nlm.nih.gov/pubmed/21989079 http://dx.doi.org/10.1186/1471-2164-12-S2-S2 |
work_keys_str_mv | AT pejavervikasrao geneclusterprofilevectorsamethodtoinferfunctionallyrelatedgenesetsbygroupingproximitybasedgeneclusters AT kimsun geneclusterprofilevectorsamethodtoinferfunctionallyrelatedgenesetsbygroupingproximitybasedgeneclusters |