Cargando…
Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional depend...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036858/ https://www.ncbi.nlm.nih.gov/pubmed/30046352 http://dx.doi.org/10.1155/2018/4626307 |
_version_ | 1783338238471045120 |
---|---|
author | Lee, JungJun Kim, SungHwan Jhong, Jae-Hwan Koo, Ja-Yong |
author_facet | Lee, JungJun Kim, SungHwan Jhong, Jae-Hwan Koo, Ja-Yong |
author_sort | Lee, JungJun |
collection | PubMed |
description | In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway. |
format | Online Article Text |
id | pubmed-6036858 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-60368582018-07-25 Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data Lee, JungJun Kim, SungHwan Jhong, Jae-Hwan Koo, Ja-Yong Comput Math Methods Med Research Article In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway. Hindawi 2018-06-25 /pmc/articles/PMC6036858/ /pubmed/30046352 http://dx.doi.org/10.1155/2018/4626307 Text en Copyright © 2018 JungJun Lee et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lee, JungJun Kim, SungHwan Jhong, Jae-Hwan Koo, Ja-Yong Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title | Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title_full | Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title_fullStr | Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title_full_unstemmed | Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title_short | Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data |
title_sort | variable selection and joint estimation of mean and covariance models with an application to eqtl data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036858/ https://www.ncbi.nlm.nih.gov/pubmed/30046352 http://dx.doi.org/10.1155/2018/4626307 |
work_keys_str_mv | AT leejungjun variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata AT kimsunghwan variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata AT jhongjaehwan variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata AT koojayong variableselectionandjointestimationofmeanandcovariancemodelswithanapplicationtoeqtldata |