Cargando…

Mixture modeling of microarray gene expression data

About 28% of genes appear to have an expression pattern that follows a mixture distribution. We use first- and second-order partial correlation coefficients to identify trios and quartets of non-sex-linked genes that are highly associated and that are also mixtures. We identified 18 trio and 35 quar...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yang, Tashman, Adam P, Lee, Jung Yeon, Yoon, Seungtai, Mao, Wenyang, Ahn, Kwangmi, Kim, Wonkuk, Mendell, Nancy R, Gordon, Derek, Finch, Stephen J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367561/
https://www.ncbi.nlm.nih.gov/pubmed/18466550
_version_ 1782154321461248000
author Yang, Yang
Tashman, Adam P
Lee, Jung Yeon
Yoon, Seungtai
Mao, Wenyang
Ahn, Kwangmi
Kim, Wonkuk
Mendell, Nancy R
Gordon, Derek
Finch, Stephen J
author_facet Yang, Yang
Tashman, Adam P
Lee, Jung Yeon
Yoon, Seungtai
Mao, Wenyang
Ahn, Kwangmi
Kim, Wonkuk
Mendell, Nancy R
Gordon, Derek
Finch, Stephen J
author_sort Yang, Yang
collection PubMed
description About 28% of genes appear to have an expression pattern that follows a mixture distribution. We use first- and second-order partial correlation coefficients to identify trios and quartets of non-sex-linked genes that are highly associated and that are also mixtures. We identified 18 trio and 35 quartet mixtures and evaluated their mixture distribution concordance. Concordance was defined as the proportion of observations that simultaneously fall in the component with the higher mean or simultaneously in the component with the lower mean based on their Bayesian posterior probabilities. These trios and quartets have a concordance rate greater than 80%. There are 33 genes involved in these trios and quartets. A factor analysis with varimax rotation identifies three gene groups based on their factor loadings. One group of 18 genes has a concordance rate of 56.7%, another group of 8 genes has a concordance rate of 60.8%, and a third group of 7 genes has a concordance rate of 69.6%. Each of these rates is highly significant, suggesting that there may be strong biological underpinnings for the mixture mechanisms of these genes. Bayesian factor screening confirms this hypothesis by identifying six single-nucleotide polymorphisms that are significantly associated with the expression phenotypes of the five most concordant genes in the first group.
format Text
id pubmed-2367561
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23675612008-05-06 Mixture modeling of microarray gene expression data Yang, Yang Tashman, Adam P Lee, Jung Yeon Yoon, Seungtai Mao, Wenyang Ahn, Kwangmi Kim, Wonkuk Mendell, Nancy R Gordon, Derek Finch, Stephen J BMC Proc Proceedings About 28% of genes appear to have an expression pattern that follows a mixture distribution. We use first- and second-order partial correlation coefficients to identify trios and quartets of non-sex-linked genes that are highly associated and that are also mixtures. We identified 18 trio and 35 quartet mixtures and evaluated their mixture distribution concordance. Concordance was defined as the proportion of observations that simultaneously fall in the component with the higher mean or simultaneously in the component with the lower mean based on their Bayesian posterior probabilities. These trios and quartets have a concordance rate greater than 80%. There are 33 genes involved in these trios and quartets. A factor analysis with varimax rotation identifies three gene groups based on their factor loadings. One group of 18 genes has a concordance rate of 56.7%, another group of 8 genes has a concordance rate of 60.8%, and a third group of 7 genes has a concordance rate of 69.6%. Each of these rates is highly significant, suggesting that there may be strong biological underpinnings for the mixture mechanisms of these genes. Bayesian factor screening confirms this hypothesis by identifying six single-nucleotide polymorphisms that are significantly associated with the expression phenotypes of the five most concordant genes in the first group. BioMed Central 2007-12-18 /pmc/articles/PMC2367561/ /pubmed/18466550 Text en Copyright © 2007 Yang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Yang, Yang
Tashman, Adam P
Lee, Jung Yeon
Yoon, Seungtai
Mao, Wenyang
Ahn, Kwangmi
Kim, Wonkuk
Mendell, Nancy R
Gordon, Derek
Finch, Stephen J
Mixture modeling of microarray gene expression data
title Mixture modeling of microarray gene expression data
title_full Mixture modeling of microarray gene expression data
title_fullStr Mixture modeling of microarray gene expression data
title_full_unstemmed Mixture modeling of microarray gene expression data
title_short Mixture modeling of microarray gene expression data
title_sort mixture modeling of microarray gene expression data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367561/
https://www.ncbi.nlm.nih.gov/pubmed/18466550
work_keys_str_mv AT yangyang mixturemodelingofmicroarraygeneexpressiondata
AT tashmanadamp mixturemodelingofmicroarraygeneexpressiondata
AT leejungyeon mixturemodelingofmicroarraygeneexpressiondata
AT yoonseungtai mixturemodelingofmicroarraygeneexpressiondata
AT maowenyang mixturemodelingofmicroarraygeneexpressiondata
AT ahnkwangmi mixturemodelingofmicroarraygeneexpressiondata
AT kimwonkuk mixturemodelingofmicroarraygeneexpressiondata
AT mendellnancyr mixturemodelingofmicroarraygeneexpressiondata
AT gordonderek mixturemodelingofmicroarraygeneexpressiondata
AT finchstephenj mixturemodelingofmicroarraygeneexpressiondata