Cargando…

Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach

The focus of modern biomedical studies has gradually shifted to explanation and estimation of joint effects of high dimensional predictors on disease risks. Quantifying uncertainty in these estimates may provide valuable insight into prevention strategies or treatment decisions for both patients and...

Descripción completa

Detalles Bibliográficos
Autores principales: Fei, Zhe, Li, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442657/
https://www.ncbi.nlm.nih.gov/pubmed/34531706
_version_ 1783753047831216128
author Fei, Zhe
Li, Yi
author_facet Fei, Zhe
Li, Yi
author_sort Fei, Zhe
collection PubMed
description The focus of modern biomedical studies has gradually shifted to explanation and estimation of joint effects of high dimensional predictors on disease risks. Quantifying uncertainty in these estimates may provide valuable insight into prevention strategies or treatment decisions for both patients and physicians. High dimensional inference, including confidence intervals and hypothesis testing, has sparked much interest. While much work has been done in the linear regression setting, there is lack of literature on inference for high dimensional generalized linear models. We propose a novel and computationally feasible method, which accommodates a variety of outcome types, including normal, binomial, and Poisson data. We use a “splitting and smoothing” approach, which splits samples into two parts, performs variable selection using one part and conducts partial regression with the other part. Averaging the estimates over multiple random splits, we obtain the smoothed estimates, which are numerically stable. We show that the estimates are consistent, asymptotically normal, and construct confidence intervals with proper coverage probabilities for all predictors. We examine the finite sample performance of our method by comparing it with the existing methods and applying it to analyze a lung cancer cohort study.
format Online
Article
Text
id pubmed-8442657
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-84426572021-09-15 Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach Fei, Zhe Li, Yi J Mach Learn Res Article The focus of modern biomedical studies has gradually shifted to explanation and estimation of joint effects of high dimensional predictors on disease risks. Quantifying uncertainty in these estimates may provide valuable insight into prevention strategies or treatment decisions for both patients and physicians. High dimensional inference, including confidence intervals and hypothesis testing, has sparked much interest. While much work has been done in the linear regression setting, there is lack of literature on inference for high dimensional generalized linear models. We propose a novel and computationally feasible method, which accommodates a variety of outcome types, including normal, binomial, and Poisson data. We use a “splitting and smoothing” approach, which splits samples into two parts, performs variable selection using one part and conducts partial regression with the other part. Averaging the estimates over multiple random splits, we obtain the smoothed estimates, which are numerically stable. We show that the estimates are consistent, asymptotically normal, and construct confidence intervals with proper coverage probabilities for all predictors. We examine the finite sample performance of our method by comparing it with the existing methods and applying it to analyze a lung cancer cohort study. 2021 /pmc/articles/PMC8442657/ /pubmed/34531706 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v22/19-132.html.
spellingShingle Article
Fei, Zhe
Li, Yi
Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title_full Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title_fullStr Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title_full_unstemmed Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title_short Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach
title_sort estimation and inference for high dimensional generalized linear models: a splitting and smoothing approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442657/
https://www.ncbi.nlm.nih.gov/pubmed/34531706
work_keys_str_mv AT feizhe estimationandinferenceforhighdimensionalgeneralizedlinearmodelsasplittingandsmoothingapproach
AT liyi estimationandinferenceforhighdimensionalgeneralizedlinearmodelsasplittingandsmoothingapproach