Cargando…

Inferring functional modules of protein families with probabilistic topic models

BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their func...

Descripción completa

Detalles Bibliográficos
Autores principales: Konietzny, Sebastian GA, Dietz, Laura, McHardy, Alice C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098182/
https://www.ncbi.nlm.nih.gov/pubmed/21554720
http://dx.doi.org/10.1186/1471-2105-12-141
_version_ 1782203930816544768
author Konietzny, Sebastian GA
Dietz, Laura
McHardy, Alice C
author_facet Konietzny, Sebastian GA
Dietz, Laura
McHardy, Alice C
author_sort Konietzny, Sebastian GA
collection PubMed
description BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their functional context. RESULTS: We describe a Bayesian method, based on a probabilistic topic model, which directly identifies functional modules of protein families. The method explores the co-occurrence patterns of protein families across a collection of sequence samples to infer a probabilistic model of arbitrarily-sized functional modules. CONCLUSIONS: We show that our method identifies protein modules - some of which correspond to well-known biological processes - that are tightly interconnected with known functional interactions and are different from the interactions identified by pairwise co-occurrence. The modules are not specific to any given organism and may combine different realizations of a protein complex or pathway within different taxa.
format Text
id pubmed-3098182
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30981822011-05-20 Inferring functional modules of protein families with probabilistic topic models Konietzny, Sebastian GA Dietz, Laura McHardy, Alice C BMC Bioinformatics Methodology Article BACKGROUND: Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their functional context. RESULTS: We describe a Bayesian method, based on a probabilistic topic model, which directly identifies functional modules of protein families. The method explores the co-occurrence patterns of protein families across a collection of sequence samples to infer a probabilistic model of arbitrarily-sized functional modules. CONCLUSIONS: We show that our method identifies protein modules - some of which correspond to well-known biological processes - that are tightly interconnected with known functional interactions and are different from the interactions identified by pairwise co-occurrence. The modules are not specific to any given organism and may combine different realizations of a protein complex or pathway within different taxa. BioMed Central 2011-05-09 /pmc/articles/PMC3098182/ /pubmed/21554720 http://dx.doi.org/10.1186/1471-2105-12-141 Text en Copyright ©2011 Konietzny et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Konietzny, Sebastian GA
Dietz, Laura
McHardy, Alice C
Inferring functional modules of protein families with probabilistic topic models
title Inferring functional modules of protein families with probabilistic topic models
title_full Inferring functional modules of protein families with probabilistic topic models
title_fullStr Inferring functional modules of protein families with probabilistic topic models
title_full_unstemmed Inferring functional modules of protein families with probabilistic topic models
title_short Inferring functional modules of protein families with probabilistic topic models
title_sort inferring functional modules of protein families with probabilistic topic models
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098182/
https://www.ncbi.nlm.nih.gov/pubmed/21554720
http://dx.doi.org/10.1186/1471-2105-12-141
work_keys_str_mv AT konietznysebastianga inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels
AT dietzlaura inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels
AT mchardyalicec inferringfunctionalmodulesofproteinfamilieswithprobabilistictopicmodels