Cargando…
Structured sparse CCA for brain imaging genetics via graph OSCAR
BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods ei...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009827/ https://www.ncbi.nlm.nih.gov/pubmed/27585988 http://dx.doi.org/10.1186/s12918-016-0312-1 |
_version_ | 1782451584302579712 |
---|---|
author | Du, Lei Huang, Heng Yan, Jingwen Kim, Sungeun Risacher, Shannon Inlow, Mark Moore, Jason Saykin, Andrew Shen, Li |
author_facet | Du, Lei Huang, Heng Yan, Jingwen Kim, Sungeun Risacher, Shannon Inlow, Mark Moore, Jason Saykin, Andrew Shen, Li |
author_sort | Du, Lei |
collection | PubMed |
description | BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods either use the fused lasso regularizer to induce the smoothness between ordered features, or use the signed pairwise difference which is dependent on the estimated sign of sample correlation. Besides, several other structured SCCA models use the group lasso or graph fused lasso to encourage group structure, but they require the structure/group information provided in advance which sometimes is not available. RESULTS: We propose a new structured SCCA model, which employs the graph OSCAR (GOSCAR) regularizer to encourage those highly correlated features to have similar or equal canonical weights. Our GOSCAR based SCCA has two advantages: 1) It does not require to pre-define the sign of the sample correlation, and thus could reduce the estimation bias. 2) It could pull those highly correlated features together no matter whether they are positively or negatively correlated. We evaluate our method using both synthetic data and real data. Using the 191 ROI measurements of amyloid imaging data, and 58 genetic markers within the APOE gene, our method identifies a strong association between APOE SNP rs429358 and the amyloid burden measure in the frontal region. In addition, the estimated canonical weights present a clear pattern which is preferable for further investigation. CONCLUSIONS: Our proposed method shows better or comparable performance on the synthetic data in terms of the estimated correlations and canonical loadings. It has successfully identified an important association between an Alzheimer’s disease risk SNP rs429358 and the amyloid burden measure in the frontal region. |
format | Online Article Text |
id | pubmed-5009827 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50098272016-09-09 Structured sparse CCA for brain imaging genetics via graph OSCAR Du, Lei Huang, Heng Yan, Jingwen Kim, Sungeun Risacher, Shannon Inlow, Mark Moore, Jason Saykin, Andrew Shen, Li BMC Syst Biol Research BACKGROUND: Recently, structured sparse canonical correlation analysis (SCCA) has received increased attention in brain imaging genetics studies. It can identify bi-multivariate imaging genetic associations as well as select relevant features with desired structure information. These SCCA methods either use the fused lasso regularizer to induce the smoothness between ordered features, or use the signed pairwise difference which is dependent on the estimated sign of sample correlation. Besides, several other structured SCCA models use the group lasso or graph fused lasso to encourage group structure, but they require the structure/group information provided in advance which sometimes is not available. RESULTS: We propose a new structured SCCA model, which employs the graph OSCAR (GOSCAR) regularizer to encourage those highly correlated features to have similar or equal canonical weights. Our GOSCAR based SCCA has two advantages: 1) It does not require to pre-define the sign of the sample correlation, and thus could reduce the estimation bias. 2) It could pull those highly correlated features together no matter whether they are positively or negatively correlated. We evaluate our method using both synthetic data and real data. Using the 191 ROI measurements of amyloid imaging data, and 58 genetic markers within the APOE gene, our method identifies a strong association between APOE SNP rs429358 and the amyloid burden measure in the frontal region. In addition, the estimated canonical weights present a clear pattern which is preferable for further investigation. CONCLUSIONS: Our proposed method shows better or comparable performance on the synthetic data in terms of the estimated correlations and canonical loadings. It has successfully identified an important association between an Alzheimer’s disease risk SNP rs429358 and the amyloid burden measure in the frontal region. BioMed Central 2016-08-26 /pmc/articles/PMC5009827/ /pubmed/27585988 http://dx.doi.org/10.1186/s12918-016-0312-1 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Du, Lei Huang, Heng Yan, Jingwen Kim, Sungeun Risacher, Shannon Inlow, Mark Moore, Jason Saykin, Andrew Shen, Li Structured sparse CCA for brain imaging genetics via graph OSCAR |
title | Structured sparse CCA for brain imaging genetics via graph OSCAR |
title_full | Structured sparse CCA for brain imaging genetics via graph OSCAR |
title_fullStr | Structured sparse CCA for brain imaging genetics via graph OSCAR |
title_full_unstemmed | Structured sparse CCA for brain imaging genetics via graph OSCAR |
title_short | Structured sparse CCA for brain imaging genetics via graph OSCAR |
title_sort | structured sparse cca for brain imaging genetics via graph oscar |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009827/ https://www.ncbi.nlm.nih.gov/pubmed/27585988 http://dx.doi.org/10.1186/s12918-016-0312-1 |
work_keys_str_mv | AT dulei structuredsparseccaforbrainimaginggeneticsviagraphoscar AT huangheng structuredsparseccaforbrainimaginggeneticsviagraphoscar AT yanjingwen structuredsparseccaforbrainimaginggeneticsviagraphoscar AT kimsungeun structuredsparseccaforbrainimaginggeneticsviagraphoscar AT risachershannon structuredsparseccaforbrainimaginggeneticsviagraphoscar AT inlowmark structuredsparseccaforbrainimaginggeneticsviagraphoscar AT moorejason structuredsparseccaforbrainimaginggeneticsviagraphoscar AT saykinandrew structuredsparseccaforbrainimaginggeneticsviagraphoscar AT shenli structuredsparseccaforbrainimaginggeneticsviagraphoscar AT structuredsparseccaforbrainimaginggeneticsviagraphoscar |