Cargando…

Discovering transcriptional modules by Bayesian data integration

Motivation: We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intui...

Descripción completa

Detalles Bibliográficos
Autores principales:	Savage, Richard S., Ghahramani, Zoubin, Griffin, Jim E., de la Cruz, Bernard J., Wild, David L.
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881394/ https://www.ncbi.nlm.nih.gov/pubmed/20529901 http://dx.doi.org/10.1093/bioinformatics/btq210

_version_	1782182114195668992
author	Savage, Richard S. Ghahramani, Zoubin Griffin, Jim E. de la Cruz, Bernard J. Wild, David L.
author_facet	Savage, Richard S. Ghahramani, Zoubin Griffin, Jim E. de la Cruz, Bernard J. Wild, David L.
author_sort	Savage, Richard S.
collection	PubMed
description	Motivation: We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intuition that co-expression and co-regulation are not necessarily equivalent and hence we do not expect all genes to group similarly in both datasets. In particular, it allows us to identify the subset of genes that share the same structure of transcriptional modules in both datasets. Results: We find that by working on a gene-by-gene basis, our model is able to extract clusters with greater functional coherence than existing methods. By combining gene expression and transcription factor binding (ChIP-chip) data in this way, we are better able to determine the groups of genes that are most likely to represent underlying TMs. Availability: If interested in the code for the work presented in this article, please contact the authors. Contact: d.l.wild@warwick.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format	Text
id	pubmed-2881394
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28813942010-06-08 Discovering transcriptional modules by Bayesian data integration Savage, Richard S. Ghahramani, Zoubin Griffin, Jim E. de la Cruz, Bernard J. Wild, David L. Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Motivation: We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intuition that co-expression and co-regulation are not necessarily equivalent and hence we do not expect all genes to group similarly in both datasets. In particular, it allows us to identify the subset of genes that share the same structure of transcriptional modules in both datasets. Results: We find that by working on a gene-by-gene basis, our model is able to extract clusters with greater functional coherence than existing methods. By combining gene expression and transcription factor binding (ChIP-chip) data in this way, we are better able to determine the groups of genes that are most likely to represent underlying TMs. Availability: If interested in the code for the work presented in this article, please contact the authors. Contact: d.l.wild@warwick.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881394/ /pubmed/20529901 http://dx.doi.org/10.1093/bioinformatics/btq210 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Savage, Richard S. Ghahramani, Zoubin Griffin, Jim E. de la Cruz, Bernard J. Wild, David L. Discovering transcriptional modules by Bayesian data integration
title	Discovering transcriptional modules by Bayesian data integration
title_full	Discovering transcriptional modules by Bayesian data integration
title_fullStr	Discovering transcriptional modules by Bayesian data integration
title_full_unstemmed	Discovering transcriptional modules by Bayesian data integration
title_short	Discovering transcriptional modules by Bayesian data integration
title_sort	discovering transcriptional modules by bayesian data integration
topic	Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881394/ https://www.ncbi.nlm.nih.gov/pubmed/20529901 http://dx.doi.org/10.1093/bioinformatics/btq210
work_keys_str_mv	AT savagerichards discoveringtranscriptionalmodulesbybayesiandataintegration AT ghahramanizoubin discoveringtranscriptionalmodulesbybayesiandataintegration AT griffinjime discoveringtranscriptionalmodulesbybayesiandataintegration AT delacruzbernardj discoveringtranscriptionalmodulesbybayesiandataintegration AT wilddavidl discoveringtranscriptionalmodulesbybayesiandataintegration

Discovering transcriptional modules by Bayesian data integration

Ejemplares similares