Cargando…

Structured sparse CCA for brain imaging genetics via graph OSCAR

BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods ei...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Lei, Huang, Heng, Yan, Jingwen, Kim, Sungeun, Risacher, Shannon, Inlow, Mark, Moore, Jason, Saykin, Andrew, Shen, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009827/
https://www.ncbi.nlm.nih.gov/pubmed/27585988
http://dx.doi.org/10.1186/s12918-016-0312-1
_version_ 1782451584302579712
author Du, Lei
Huang, Heng
Yan, Jingwen
Kim, Sungeun
Risacher, Shannon
Inlow, Mark
Moore, Jason
Saykin, Andrew
Shen, Li
author_facet Du, Lei
Huang, Heng
Yan, Jingwen
Kim, Sungeun
Risacher, Shannon
Inlow, Mark
Moore, Jason
Saykin, Andrew
Shen, Li
author_sort Du, Lei
collection PubMed
description BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods either use the fused lasso regularizer to induce the smoothness between ordered features, or use the signed pairwise difference which is dependent on the estimated sign of sample correlation. Besides, several other structured SCCA models use the group lasso or graph fused lasso to encourage group structure, but they require the structure/group information provided in advance which sometimes is not available. RESULTS: We propose a new structured SCCA model, which employs the graph OSCAR (GOSCAR) regularizer to encourage those highly correlated features to have similar or equal canonical weights. Our GOSCAR based SCCA has two advantages: 1) It does not require to pre-define the sign of the sample correlation, and thus could reduce the estimation bias. 2) It could pull those highly correlated features together no matter whether they are positively or negatively correlated. We evaluate our method using both synthetic data and real data. Using the 191 ROI measurements of amyloid imaging data, and 58 genetic markers within the APOE gene, our method identifies a strong association between APOE SNP rs429358 and the amyloid burden measure in the frontal region. In addition, the estimated canonical weights present a clear pattern which is preferable for further investigation. CONCLUSIONS: Our proposed method shows better or comparable performance on the synthetic data in terms of the estimated correlations and canonical loadings. It has successfully identified an important association between an Alzheimer’s disease risk SNP rs429358 and the amyloid burden measure in the frontal region.
format Online
Article
Text
id pubmed-5009827
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50098272016-09-09 Structured sparse CCA for brain imaging genetics via graph OSCAR Du, Lei Huang, Heng Yan, Jingwen Kim, Sungeun Risacher, Shannon Inlow, Mark Moore, Jason Saykin, Andrew Shen, Li BMC Syst Biol Research BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods either use the fused lasso regularizer to induce the smoothness between ordered features, or use the signed pairwise difference which is dependent on the estimated sign of sample correlation. Besides, several other structured SCCA models use the group lasso or graph fused lasso to encourage group structure, but they require the structure/group information provided in advance which sometimes is not available. RESULTS: We propose a new structured SCCA model, which employs the graph OSCAR (GOSCAR) regularizer to encourage those highly correlated features to have similar or equal canonical weights. Our GOSCAR based SCCA has two advantages: 1) It does not require to pre-define the sign of the sample correlation, and thus could reduce the estimation bias. 2) It could pull those highly correlated features together no matter whether they are positively or negatively correlated. We evaluate our method using both synthetic data and real data. Using the 191 ROI measurements of amyloid imaging data, and 58 genetic markers within the APOE gene, our method identifies a strong association between APOE SNP rs429358 and the amyloid burden measure in the frontal region. In addition, the estimated canonical weights present a clear pattern which is preferable for further investigation. CONCLUSIONS: Our proposed method shows better or comparable performance on the synthetic data in terms of the estimated correlations and canonical loadings. It has successfully identified an important association between an Alzheimer’s disease risk SNP rs429358 and the amyloid burden measure in the frontal region. BioMed Central 2016-08-26 /pmc/articles/PMC5009827/ /pubmed/27585988 http://dx.doi.org/10.1186/s12918-016-0312-1 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Du, Lei
Huang, Heng
Yan, Jingwen
Kim, Sungeun
Risacher, Shannon
Inlow, Mark
Moore, Jason
Saykin, Andrew
Shen, Li
Structured sparse CCA for brain imaging genetics via graph OSCAR
title Structured sparse CCA for brain imaging genetics via graph OSCAR
title_full Structured sparse CCA for brain imaging genetics via graph OSCAR
title_fullStr Structured sparse CCA for brain imaging genetics via graph OSCAR
title_full_unstemmed Structured sparse CCA for brain imaging genetics via graph OSCAR
title_short Structured sparse CCA for brain imaging genetics via graph OSCAR
title_sort structured sparse cca for brain imaging genetics via graph oscar
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009827/
https://www.ncbi.nlm.nih.gov/pubmed/27585988
http://dx.doi.org/10.1186/s12918-016-0312-1
work_keys_str_mv AT dulei structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT huangheng structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT yanjingwen structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT kimsungeun structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT risachershannon structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT inlowmark structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT moorejason structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT saykinandrew structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT shenli structuredsparseccaforbrainimaginggeneticsviagraphoscar
AT structuredsparseccaforbrainimaginggeneticsviagraphoscar