Cargando…

In silico microdissection of microarray data from heterogeneous cell populations

BACKGROUND: Very few analytical approaches have been reported to resolve the variability in microarray measurements stemming from sample heterogeneity. For example, tissue samples used in cancer studies are usually contaminated with the surrounding or infiltrating cell types. This heterogeneity in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lähdesmäki, Harri, Shmulevich, llya, Dunmire, Valerie, Yli-Harja, Olli, Zhang, Wei
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1274251/
https://www.ncbi.nlm.nih.gov/pubmed/15766384
http://dx.doi.org/10.1186/1471-2105-6-54
_version_ 1782125973177630720
author Lähdesmäki, Harri
Shmulevich, llya
Dunmire, Valerie
Yli-Harja, Olli
Zhang, Wei
author_facet Lähdesmäki, Harri
Shmulevich, llya
Dunmire, Valerie
Yli-Harja, Olli
Zhang, Wei
author_sort Lähdesmäki, Harri
collection PubMed
description BACKGROUND: Very few analytical approaches have been reported to resolve the variability in microarray measurements stemming from sample heterogeneity. For example, tissue samples used in cancer studies are usually contaminated with the surrounding or infiltrating cell types. This heterogeneity in the sample preparation hinders further statistical analysis, significantly so if different samples contain different proportions of these cell types. Thus, sample heterogeneity can result in the identification of differentially expressed genes that may be unrelated to the biological question being studied. Similarly, irrelevant gene combinations can be discovered in the case of gene expression based classification. RESULTS: We propose a computational framework for removing the effects of sample heterogeneity by "microdissecting" microarray data in silico. The computational method provides estimates of the expression values of the pure (non-heterogeneous) cell samples. The inversion of the sample heterogeneity can be facilitated by providing accurate estimates of the mixing percentages of different cell types in each measurement. For those cases where no such information is available, we develop an optimization-based method for joint estimation of the mixing percentages and the expression values of the pure cell samples. We also consider the problem of selecting the correct number of cell types. CONCLUSION: The efficiency of the proposed methods is illustrated by applying them to a carefully controlled cDNA microarray data obtained from heterogeneous samples. The results demonstrate that the methods are capable of reconstructing both the sample and cell type specific expression values from heterogeneous mixtures and that the mixing percentages of different cell types can also be estimated. Furthermore, a general purpose model selection method can be used to select the correct number of cell types.
format Text
id pubmed-1274251
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12742512005-10-29 In silico microdissection of microarray data from heterogeneous cell populations Lähdesmäki, Harri Shmulevich, llya Dunmire, Valerie Yli-Harja, Olli Zhang, Wei BMC Bioinformatics Methodology Article BACKGROUND: Very few analytical approaches have been reported to resolve the variability in microarray measurements stemming from sample heterogeneity. For example, tissue samples used in cancer studies are usually contaminated with the surrounding or infiltrating cell types. This heterogeneity in the sample preparation hinders further statistical analysis, significantly so if different samples contain different proportions of these cell types. Thus, sample heterogeneity can result in the identification of differentially expressed genes that may be unrelated to the biological question being studied. Similarly, irrelevant gene combinations can be discovered in the case of gene expression based classification. RESULTS: We propose a computational framework for removing the effects of sample heterogeneity by "microdissecting" microarray data in silico. The computational method provides estimates of the expression values of the pure (non-heterogeneous) cell samples. The inversion of the sample heterogeneity can be facilitated by providing accurate estimates of the mixing percentages of different cell types in each measurement. For those cases where no such information is available, we develop an optimization-based method for joint estimation of the mixing percentages and the expression values of the pure cell samples. We also consider the problem of selecting the correct number of cell types. CONCLUSION: The efficiency of the proposed methods is illustrated by applying them to a carefully controlled cDNA microarray data obtained from heterogeneous samples. The results demonstrate that the methods are capable of reconstructing both the sample and cell type specific expression values from heterogeneous mixtures and that the mixing percentages of different cell types can also be estimated. Furthermore, a general purpose model selection method can be used to select the correct number of cell types. BioMed Central 2005-03-14 /pmc/articles/PMC1274251/ /pubmed/15766384 http://dx.doi.org/10.1186/1471-2105-6-54 Text en Copyright © 2005 Lähdesmäki et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Lähdesmäki, Harri
Shmulevich, llya
Dunmire, Valerie
Yli-Harja, Olli
Zhang, Wei
In silico microdissection of microarray data from heterogeneous cell populations
title In silico microdissection of microarray data from heterogeneous cell populations
title_full In silico microdissection of microarray data from heterogeneous cell populations
title_fullStr In silico microdissection of microarray data from heterogeneous cell populations
title_full_unstemmed In silico microdissection of microarray data from heterogeneous cell populations
title_short In silico microdissection of microarray data from heterogeneous cell populations
title_sort in silico microdissection of microarray data from heterogeneous cell populations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1274251/
https://www.ncbi.nlm.nih.gov/pubmed/15766384
http://dx.doi.org/10.1186/1471-2105-6-54
work_keys_str_mv AT lahdesmakiharri insilicomicrodissectionofmicroarraydatafromheterogeneouscellpopulations
AT shmulevichllya insilicomicrodissectionofmicroarraydatafromheterogeneouscellpopulations
AT dunmirevalerie insilicomicrodissectionofmicroarraydatafromheterogeneouscellpopulations
AT yliharjaolli insilicomicrodissectionofmicroarraydatafromheterogeneouscellpopulations
AT zhangwei insilicomicrodissectionofmicroarraydatafromheterogeneouscellpopulations