Cargando…

SR-TWAS: Leveraging Multiple Reference Panels to Improve TWAS Power by Ensemble Machine Learning

Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for TWAS. To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods,...

Descripción completa

Detalles Bibliográficos
Autores principales: Parrish, Randy L., Buchman, Aron S., Tasaki, Shinya, Wang, Yanling, Avey, Denis, Xu, Jishu, De Jager, Philip L., Bennett, David A., Epstein, Michael P., Yang, Jingjing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327185/
https://www.ncbi.nlm.nih.gov/pubmed/37425698
http://dx.doi.org/10.1101/2023.06.20.23291605
Descripción
Sumario:Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for TWAS. To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods, and tissues, we develop a Stacked Regression based TWAS (SR-TWAS) tool which can obtain optimal linear combinations of base models for a given validation transcriptomic dataset. Both simulation and real studies showed that SR-TWAS improved power due to increased effective training sample sizes and borrowed strength across multiple regression methods and tissues. Leveraging base models across multiple reference panels, tissues, and regression methods, our studies of Alzheimer’s disease (AD) dementia and Parkinson’s disease (PD) identified respective 11 independent significant risk genes for AD (supplementary motor area tissue) and 12 independent significant risk genes for PD (substantia nigra tissue), including 6 novels for AD and 6 novels for PD.