Cargando…
inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics
Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studie...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634656/ https://www.ncbi.nlm.nih.gov/pubmed/37442417 http://dx.doi.org/10.1016/j.gpb.2023.03.005 |
_version_ | 1785132861821550592 |
---|---|
author | Du, Lei Zhang, Jin Zhao, Ying Shang, Muheng Guo, Lei Han, Junwei |
author_facet | Du, Lei Zhang, Jin Zhao, Ying Shang, Muheng Guo, Lei Han, Junwei |
author_sort | Du, Lei |
collection | PubMed |
description | Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studies. Biologically, a co-varying pattern of different omics-derived endophenotypes could result from the shared genetic basis. However, existing methods mainly focus on the effect of endophenotypes alone; the effect of cross-endophenotype (CEP) associations remains largely unexploited. In this study, we used both endophenotypes and their CEP associations of multi-omic data to identify genetic risk factors, and proposed two integrated multi-task sparse canonical correlation analysis (inMTSCCA) methods, i.e., pairwise endophenotype correlation-guided MTSCCA (pcMTSCCA) and high-order endophenotype correlation-guided MTSCCA (hocMTSCCA). pcMTSCCA employed pairwise correlations between magnetic resonance imaging (MRI)-derived, plasma-derived, and cerebrospinal fluid (CSF)-derived endophenotypes as an additional penalty. hocMTSCCA used high-order correlations among these multi-omic data for regularization. To figure out genetic risk factors at individual and group levels, as well as altered endophenotypic markers, we introduced sparsity-inducing penalties for both models. We compared pcMTSCCA and hocMTSCCA with three related methods on both simulation and real (consisting of neuroimaging data, proteomic analytes, and genetic data) datasets. The results showed that our methods obtained better or comparable canonical correlation coefficients (CCCs) and better feature subsets than benchmarks. Most importantly, the identified genetic loci and heterogeneous endophenotypic markers showed high relevance. Therefore, jointly using multi-omic endophenotypes and their CEP associations is promising to reveal genetic risk factors. The source code and manual of inMTSCCA are available at https://ngdc.cncb.ac.cn/biocode/tools/BT007330. |
format | Online Article Text |
id | pubmed-10634656 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-106346562023-11-10 inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics Du, Lei Zhang, Jin Zhao, Ying Shang, Muheng Guo, Lei Han, Junwei Genomics Proteomics Bioinformatics Method Identifying genetic risk factors for Alzheimer’s disease (AD) is an important research topic. To date, different endophenotypes, such as imaging-derived endophenotypes and proteomic expression-derived endophenotypes, have shown the great value in uncovering risk genes compared to case–control studies. Biologically, a co-varying pattern of different omics-derived endophenotypes could result from the shared genetic basis. However, existing methods mainly focus on the effect of endophenotypes alone; the effect of cross-endophenotype (CEP) associations remains largely unexploited. In this study, we used both endophenotypes and their CEP associations of multi-omic data to identify genetic risk factors, and proposed two integrated multi-task sparse canonical correlation analysis (inMTSCCA) methods, i.e., pairwise endophenotype correlation-guided MTSCCA (pcMTSCCA) and high-order endophenotype correlation-guided MTSCCA (hocMTSCCA). pcMTSCCA employed pairwise correlations between magnetic resonance imaging (MRI)-derived, plasma-derived, and cerebrospinal fluid (CSF)-derived endophenotypes as an additional penalty. hocMTSCCA used high-order correlations among these multi-omic data for regularization. To figure out genetic risk factors at individual and group levels, as well as altered endophenotypic markers, we introduced sparsity-inducing penalties for both models. We compared pcMTSCCA and hocMTSCCA with three related methods on both simulation and real (consisting of neuroimaging data, proteomic analytes, and genetic data) datasets. The results showed that our methods obtained better or comparable canonical correlation coefficients (CCCs) and better feature subsets than benchmarks. Most importantly, the identified genetic loci and heterogeneous endophenotypic markers showed high relevance. Therefore, jointly using multi-omic endophenotypes and their CEP associations is promising to reveal genetic risk factors. The source code and manual of inMTSCCA are available at https://ngdc.cncb.ac.cn/biocode/tools/BT007330. Elsevier 2023-04 2023-07-11 /pmc/articles/PMC10634656/ /pubmed/37442417 http://dx.doi.org/10.1016/j.gpb.2023.03.005 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Method Du, Lei Zhang, Jin Zhao, Ying Shang, Muheng Guo, Lei Han, Junwei inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title | inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title_full | inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title_fullStr | inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title_full_unstemmed | inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title_short | inMTSCCA: An Integrated Multi-task Sparse Canonical Correlation Analysis for Multi-omic Brain Imaging Genetics |
title_sort | inmtscca: an integrated multi-task sparse canonical correlation analysis for multi-omic brain imaging genetics |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634656/ https://www.ncbi.nlm.nih.gov/pubmed/37442417 http://dx.doi.org/10.1016/j.gpb.2023.03.005 |
work_keys_str_mv | AT dulei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT zhangjin inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT zhaoying inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT shangmuheng inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT guolei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT hanjunwei inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics AT inmtsccaanintegratedmultitasksparsecanonicalcorrelationanalysisformultiomicbrainimaginggenetics |