Cargando…

Conditional screening for ultrahigh-dimensional survival data in case-cohort studies

The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jing, Zhou, Haibo, Liu, Yanyan, Cai, Jianwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8561435/
https://www.ncbi.nlm.nih.gov/pubmed/34417679
http://dx.doi.org/10.1007/s10985-021-09531-7
_version_ 1784593121102790656
author Zhang, Jing
Zhou, Haibo
Liu, Yanyan
Cai, Jianwen
author_facet Zhang, Jing
Zhou, Haibo
Liu, Yanyan
Cai, Jianwen
author_sort Zhang, Jing
collection PubMed
description The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening (SIS), screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer.
format Online
Article
Text
id pubmed-8561435
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-85614352022-10-01 Conditional screening for ultrahigh-dimensional survival data in case-cohort studies Zhang, Jing Zhou, Haibo Liu, Yanyan Cai, Jianwen Lifetime Data Anal Article The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening (SIS), screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer. 2021-08-20 2021-10 /pmc/articles/PMC8561435/ /pubmed/34417679 http://dx.doi.org/10.1007/s10985-021-09531-7 Text en https://creativecommons.org/licenses/by/4.0/Under no circumstances may this AM be shared or distributed under a Creative Commons or other form of open access license, nor may it be reformatted or enhanced, whether by the Author or third parties. See here for Springer Nature’s terms of use for AM versions of subscription articles: https//www.springernature.com/gp/open-research/policies/accepted-manuscript-terms
spellingShingle Article
Zhang, Jing
Zhou, Haibo
Liu, Yanyan
Cai, Jianwen
Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title_full Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title_fullStr Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title_full_unstemmed Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title_short Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
title_sort conditional screening for ultrahigh-dimensional survival data in case-cohort studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8561435/
https://www.ncbi.nlm.nih.gov/pubmed/34417679
http://dx.doi.org/10.1007/s10985-021-09531-7
work_keys_str_mv AT zhangjing conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies
AT zhouhaibo conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies
AT liuyanyan conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies
AT caijianwen conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies