Cargando…

Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm

Motivation: Imaging genetics is an emerging field that studies the influence of genetic variation on brain structure and function. The major task is to examine the association between genetic markers such as single-nucleotide polymorphisms (SNPs) and quantitative traits (QTs) extracted from neuroima...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Jingwen, Du, Lei, Kim, Sungeun, Risacher, Shannon L., Huang, Heng, Moore, Jason H., Saykin, Andrew J., Shen, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147918/
https://www.ncbi.nlm.nih.gov/pubmed/25161248
http://dx.doi.org/10.1093/bioinformatics/btu465
_version_ 1782332537548308480
author Yan, Jingwen
Du, Lei
Kim, Sungeun
Risacher, Shannon L.
Huang, Heng
Moore, Jason H.
Saykin, Andrew J.
Shen, Li
author_facet Yan, Jingwen
Du, Lei
Kim, Sungeun
Risacher, Shannon L.
Huang, Heng
Moore, Jason H.
Saykin, Andrew J.
Shen, Li
author_sort Yan, Jingwen
collection PubMed
description Motivation: Imaging genetics is an emerging field that studies the influence of genetic variation on brain structure and function. The major task is to examine the association between genetic markers such as single-nucleotide polymorphisms (SNPs) and quantitative traits (QTs) extracted from neuroimaging data. The complexity of these datasets has presented critical bioinformatics challenges that require new enabling tools. Sparse canonical correlation analysis (SCCA) is a bi-multivariate technique used in imaging genetics to identify complex multi-SNP–multi-QT associations. However, most of the existing SCCA algorithms are designed using the soft thresholding method, which assumes that the input features are independent from one another. This assumption clearly does not hold for the imaging genetic data. In this article, we propose a new knowledge-guided SCCA algorithm (KG-SCCA) to overcome this limitation as well as improve learning results by incorporating valuable prior knowledge. Results: The proposed KG-SCCA method is able to model two types of prior knowledge: one as a group structure (e.g. linkage disequilibrium blocks among SNPs) and the other as a network structure (e.g. gene co-expression network among brain regions). The new model incorporates these prior structures by introducing new regularization terms to encourage weight similarity between grouped or connected features. A new algorithm is designed to solve the KG-SCCA model without imposing the independence constraint on the input features. We demonstrate the effectiveness of our algorithm with both synthetic and real data. For real data, using an Alzheimer’s disease (AD) cohort, we examine the imaging genetic associations between all SNPs in the APOE gene (i.e. top AD gene) and amyloid deposition measures among cortical regions (i.e. a major AD hallmark). In comparison with a widely used SCCA implementation, our KG-SCCA algorithm produces not only improved cross-validation performances but also biologically meaningful results. Availability: Software is freely available on request. Contact: shenli@iu.edu
format Online
Article
Text
id pubmed-4147918
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41479182014-09-02 Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm Yan, Jingwen Du, Lei Kim, Sungeun Risacher, Shannon L. Huang, Heng Moore, Jason H. Saykin, Andrew J. Shen, Li Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Imaging genetics is an emerging field that studies the influence of genetic variation on brain structure and function. The major task is to examine the association between genetic markers such as single-nucleotide polymorphisms (SNPs) and quantitative traits (QTs) extracted from neuroimaging data. The complexity of these datasets has presented critical bioinformatics challenges that require new enabling tools. Sparse canonical correlation analysis (SCCA) is a bi-multivariate technique used in imaging genetics to identify complex multi-SNP–multi-QT associations. However, most of the existing SCCA algorithms are designed using the soft thresholding method, which assumes that the input features are independent from one another. This assumption clearly does not hold for the imaging genetic data. In this article, we propose a new knowledge-guided SCCA algorithm (KG-SCCA) to overcome this limitation as well as improve learning results by incorporating valuable prior knowledge. Results: The proposed KG-SCCA method is able to model two types of prior knowledge: one as a group structure (e.g. linkage disequilibrium blocks among SNPs) and the other as a network structure (e.g. gene co-expression network among brain regions). The new model incorporates these prior structures by introducing new regularization terms to encourage weight similarity between grouped or connected features. A new algorithm is designed to solve the KG-SCCA model without imposing the independence constraint on the input features. We demonstrate the effectiveness of our algorithm with both synthetic and real data. For real data, using an Alzheimer’s disease (AD) cohort, we examine the imaging genetic associations between all SNPs in the APOE gene (i.e. top AD gene) and amyloid deposition measures among cortical regions (i.e. a major AD hallmark). In comparison with a widely used SCCA implementation, our KG-SCCA algorithm produces not only improved cross-validation performances but also biologically meaningful results. Availability: Software is freely available on request. Contact: shenli@iu.edu Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147918/ /pubmed/25161248 http://dx.doi.org/10.1093/bioinformatics/btu465 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Eccb 2014 Proceedings Papers Committee
Yan, Jingwen
Du, Lei
Kim, Sungeun
Risacher, Shannon L.
Huang, Heng
Moore, Jason H.
Saykin, Andrew J.
Shen, Li
Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title_full Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title_fullStr Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title_full_unstemmed Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title_short Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
title_sort transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm
topic Eccb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147918/
https://www.ncbi.nlm.nih.gov/pubmed/25161248
http://dx.doi.org/10.1093/bioinformatics/btu465
work_keys_str_mv AT yanjingwen transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT dulei transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT kimsungeun transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT risachershannonl transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT huangheng transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT moorejasonh transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT saykinandrewj transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT shenli transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm
AT transcriptomeguidedamyloidimaginggeneticanalysisviaanovelstructuredsparselearningalgorithm