Cargando…

Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data

In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional depend...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, JungJun, Kim, SungHwan, Jhong, Jae-Hwan, Koo, Ja-Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036858/
https://www.ncbi.nlm.nih.gov/pubmed/30046352
http://dx.doi.org/10.1155/2018/4626307
_version_ 1783338238471045120
author Lee, JungJun
Kim, SungHwan
Jhong, Jae-Hwan
Koo, Ja-Yong
author_facet Lee, JungJun
Kim, SungHwan
Jhong, Jae-Hwan
Koo, Ja-Yong
author_sort Lee, JungJun
collection PubMed
description In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway.
format Online
Article
Text
id pubmed-6036858
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-60368582018-07-25 Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data Lee, JungJun Kim, SungHwan Jhong, Jae-Hwan Koo, Ja-Yong Comput Math Methods Med Research Article In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway. Hindawi 2018-06-25 /pmc/articles/PMC6036858/ /pubmed/30046352 http://dx.doi.org/10.1155/2018/4626307 Text en Copyright © 2018 JungJun Lee et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lee, JungJun
Kim, SungHwan
Jhong, Jae-Hwan
Koo, Ja-Yong
Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title_full Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title_fullStr Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title_full_unstemmed Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title_short Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
title_sort variable selection and joint estimation of mean and covariance models with an application to eqtl data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036858/
https://www.ncbi.nlm.nih.gov/pubmed/30046352
http://dx.doi.org/10.1155/2018/4626307
work_keys_str_mv AT leejungjun variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata
AT kimsunghwan variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata
AT jhongjaehwan variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata
AT koojayong variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata