Cargando…

Measuring similarity between gene expression profiles: a Bayesian approach

BACKGROUND: Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Viet-Anh, Lió, Pietro
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788366/
https://www.ncbi.nlm.nih.gov/pubmed/19958477
http://dx.doi.org/10.1186/1471-2164-10-S3-S14
_version_ 1782174965043298304
author Nguyen, Viet-Anh
Lió, Pietro
author_facet Nguyen, Viet-Anh
Lió, Pietro
author_sort Nguyen, Viet-Anh
collection PubMed
description BACKGROUND: Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data. RESULTS: We advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis. CONCLUSION: BayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data.
format Text
id pubmed-2788366
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27883662009-12-04 Measuring similarity between gene expression profiles: a Bayesian approach Nguyen, Viet-Anh Lió, Pietro BMC Genomics Proceedings BACKGROUND: Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data. RESULTS: We advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis. CONCLUSION: BayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data. BioMed Central 2009-12-03 /pmc/articles/PMC2788366/ /pubmed/19958477 http://dx.doi.org/10.1186/1471-2164-10-S3-S14 Text en Copyright ©2009 Nguyen and Lió; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Nguyen, Viet-Anh
Lió, Pietro
Measuring similarity between gene expression profiles: a Bayesian approach
title Measuring similarity between gene expression profiles: a Bayesian approach
title_full Measuring similarity between gene expression profiles: a Bayesian approach
title_fullStr Measuring similarity between gene expression profiles: a Bayesian approach
title_full_unstemmed Measuring similarity between gene expression profiles: a Bayesian approach
title_short Measuring similarity between gene expression profiles: a Bayesian approach
title_sort measuring similarity between gene expression profiles: a bayesian approach
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788366/
https://www.ncbi.nlm.nih.gov/pubmed/19958477
http://dx.doi.org/10.1186/1471-2164-10-S3-S14
work_keys_str_mv AT nguyenvietanh measuringsimilaritybetweengeneexpressionprofilesabayesianapproach
AT liopietro measuringsimilaritybetweengeneexpressionprofilesabayesianapproach