Cargando…
Inferring functional modules of protein families with probabilistic topic models
BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their func...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098182/ https://www.ncbi.nlm.nih.gov/pubmed/21554720 http://dx.doi.org/10.1186/1471-2105-12-141 |
_version_ | 1782203930816544768 |
---|---|
author | Konietzny, Sebastian GA Dietz, Laura McHardy, Alice C |
author_facet | Konietzny, Sebastian GA Dietz, Laura McHardy, Alice C |
author_sort | Konietzny, Sebastian GA |
collection | PubMed |
description | BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their functional context. RESULTS: We describe a Bayesian method, based on a probabilistic topic model, which directly identifies functional modules of protein families. The method explores the co-occurrence patterns of protein families across a collection of sequence samples to infer a probabilistic model of arbitrarily-sized functional modules. CONCLUSIONS: We show that our method identifies protein modules - some of which correspond to well-known biological processes - that are tightly interconnected with known functional interactions and are different from the interactions identified by pairwise co-occurrence. The modules are not specific to any given organism and may combine different realizations of a protein complex or pathway within different taxa. |
format | Text |
id | pubmed-3098182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30981822011-05-20 Inferring functional modules of protein families with probabilistic topic models Konietzny, Sebastian GA Dietz, Laura McHardy, Alice C BMC Bioinformatics Methodology Article BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their functional context. RESULTS: We describe a Bayesian method, based on a probabilistic topic model, which directly identifies functional modules of protein families. The method explores the co-occurrence patterns of protein families across a collection of sequence samples to infer a probabilistic model of arbitrarily-sized functional modules. CONCLUSIONS: We show that our method identifies protein modules - some of which correspond to well-known biological processes - that are tightly interconnected with known functional interactions and are different from the interactions identified by pairwise co-occurrence. The modules are not specific to any given organism and may combine different realizations of a protein complex or pathway within different taxa. BioMed Central 2011-05-09 /pmc/articles/PMC3098182/ /pubmed/21554720 http://dx.doi.org/10.1186/1471-2105-12-141 Text en Copyright ©2011 Konietzny et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Konietzny, Sebastian GA Dietz, Laura McHardy, Alice C Inferring functional modules of protein families with probabilistic topic models |
title | Inferring functional modules of protein families with probabilistic topic models |
title_full | Inferring functional modules of protein families with probabilistic topic models |
title_fullStr | Inferring functional modules of protein families with probabilistic topic models |
title_full_unstemmed | Inferring functional modules of protein families with probabilistic topic models |
title_short | Inferring functional modules of protein families with probabilistic topic models |
title_sort | inferring functional modules of protein families with probabilistic topic models |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098182/ https://www.ncbi.nlm.nih.gov/pubmed/21554720 http://dx.doi.org/10.1186/1471-2105-12-141 |
work_keys_str_mv | AT konietznysebastianga inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels AT dietzlaura inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels AT mchardyalicec inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels |