Cargando…

inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics

Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studie...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Lei, Zhang, Jin, Zhao, Ying, Shang, Muheng, Guo, Lei, Han, Junwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634656/
https://www.ncbi.nlm.nih.gov/pubmed/37442417
http://dx.doi.org/10.1016/j.gpb.2023.03.005
_version_ 1785132861821550592
author Du, Lei
Zhang, Jin
Zhao, Ying
Shang, Muheng
Guo, Lei
Han, Junwei
author_facet Du, Lei
Zhang, Jin
Zhao, Ying
Shang, Muheng
Guo, Lei
Han, Junwei
author_sort Du, Lei
collection PubMed
description Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studies. Biologically, a co-varying pattern of different omics-derived endophenotypes could result from the shared genetic basis. However, existing methods mainly focus on the effect of endophenotypes alone; the effect of cross-endophenotype (CEP) associations remains largely unexploited. In this study, we used both endophenotypes and their CEP associations of multi-omic data to identify genetic risk factors, and proposed two integrated multi-task sparse canonical correlation analysis (inMTSCCA) methods, i.e., pairwise endophenotype correlation-guided MTSCCA (pcMTSCCA) and high-order endophenotype correlation-guided MTSCCA (hocMTSCCA). pcMTSCCA employed pairwise correlations between magnetic resonance imaging (MRI)-derived, plasma-derived, and cerebrospinal fluid (CSF)-derived endophenotypes as an additional penalty. hocMTSCCA used high-order correlations among these multi-omic data for regularization. To figure out genetic risk factors at individual and group levels, as well as altered endophenotypic markers, we introduced sparsity-inducing penalties for both models. We compared pcMTSCCA and hocMTSCCA with three related methods on both simulation and real (consisting of neuroimaging data, proteomic analytes, and genetic data) datasets. The results showed that our methods obtained better or comparable canonical correlation coefficients (CCCs) and better feature subsets than benchmarks. Most importantly, the identified genetic loci and heterogeneous endophenotypic markers showed high relevance. Therefore, jointly using multi-omic endophenotypes and their CEP associations is promising to reveal genetic risk factors. The source code and manual of inMTSCCA are available at https://ngdc.cncb.ac.cn/biocode/tools/BT007330.
format Online
Article
Text
id pubmed-10634656
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106346562023-11-10 inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics Du, Lei Zhang, Jin Zhao, Ying Shang, Muheng Guo, Lei Han, Junwei Genomics Proteomics Bioinformatics Method Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studies. Biologically, a co-varying pattern of different omics-derived endophenotypes could result from the shared genetic basis. However, existing methods mainly focus on the effect of endophenotypes alone; the effect of cross-endophenotype (CEP) associations remains largely unexploited. In this study, we used both endophenotypes and their CEP associations of multi-omic data to identify genetic risk factors, and proposed two integrated multi-task sparse canonical correlation analysis (inMTSCCA) methods, i.e., pairwise endophenotype correlation-guided MTSCCA (pcMTSCCA) and high-order endophenotype correlation-guided MTSCCA (hocMTSCCA). pcMTSCCA employed pairwise correlations between magnetic resonance imaging (MRI)-derived, plasma-derived, and cerebrospinal fluid (CSF)-derived endophenotypes as an additional penalty. hocMTSCCA used high-order correlations among these multi-omic data for regularization. To figure out genetic risk factors at individual and group levels, as well as altered endophenotypic markers, we introduced sparsity-inducing penalties for both models. We compared pcMTSCCA and hocMTSCCA with three related methods on both simulation and real (consisting of neuroimaging data, proteomic analytes, and genetic data) datasets. The results showed that our methods obtained better or comparable canonical correlation coefficients (CCCs) and better feature subsets than benchmarks. Most importantly, the identified genetic loci and heterogeneous endophenotypic markers showed high relevance. Therefore, jointly using multi-omic endophenotypes and their CEP associations is promising to reveal genetic risk factors. The source code and manual of inMTSCCA are available at https://ngdc.cncb.ac.cn/biocode/tools/BT007330. Elsevier 2023-04 2023-07-11 /pmc/articles/PMC10634656/ /pubmed/37442417 http://dx.doi.org/10.1016/j.gpb.2023.03.005 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Method
Du, Lei
Zhang, Jin
Zhao, Ying
Shang, Muheng
Guo, Lei
Han, Junwei
inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title_full inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title_fullStr inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title_full_unstemmed inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title_short inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
title_sort inmtscca: an integrated multi-task sparse canonical correlation analysis for multi-omic brain imaging genetics
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634656/
https://www.ncbi.nlm.nih.gov/pubmed/37442417
http://dx.doi.org/10.1016/j.gpb.2023.03.005
work_keys_str_mv AT dulei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT zhangjin inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT zhaoying inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT shangmuheng inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT guolei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT hanjunwei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics
AT inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics