Cargando…
Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8561435/ https://www.ncbi.nlm.nih.gov/pubmed/34417679 http://dx.doi.org/10.1007/s10985-021-09531-7 |
_version_ | 1784593121102790656 |
---|---|
author | Zhang, Jing Zhou, Haibo Liu, Yanyan Cai, Jianwen |
author_facet | Zhang, Jing Zhou, Haibo Liu, Yanyan Cai, Jianwen |
author_sort | Zhang, Jing |
collection | PubMed |
description | The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening (SIS), screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer. |
format | Online Article Text |
id | pubmed-8561435 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-85614352022-10-01 Conditional screening for ultrahigh-dimensional survival data in case-cohort studies Zhang, Jing Zhou, Haibo Liu, Yanyan Cai, Jianwen Lifetime Data Anal Article The case-cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In many such studies, the number of covariates is very large, and the goal of the research is to identify active covariates which have great influence on response. Since the introduction of sure independence screening (SIS), screening procedures have achieved great success in terms of effectively reducing the dimensionality and identifying active covariates. However, commonly used screening methods are based on marginal correlation or its variants, they may fail to identify hidden active variables which are jointly important but are weakly correlated with the response. Moreover, these screening methods are mainly proposed for data under the simple random sampling and can not be directly applied to case-cohort data. In this paper, we consider the ultrahigh-dimensional survival data under the case-cohort design, and propose a conditional screening method by incorporating some important prior known information of active variables. This method can effectively detect hidden active variables. Furthermore, it possesses the sure screening property under some mild regularity conditions and does not require any complicated numerical optimization. We evaluate the finite sample performance of the proposed method via extensive simulation studies and further illustrate the new approach through a real data set from patients with breast cancer. 2021-08-20 2021-10 /pmc/articles/PMC8561435/ /pubmed/34417679 http://dx.doi.org/10.1007/s10985-021-09531-7 Text en https://creativecommons.org/licenses/by/4.0/Under no circumstances may this AM be shared or distributed under a Creative Commons or other form of open access license, nor may it be reformatted or enhanced, whether by the Author or third parties. See here for Springer Nature’s terms of use for AM versions of subscription articles: https//www.springernature.com/gp/open-research/policies/accepted-manuscript-terms |
spellingShingle | Article Zhang, Jing Zhou, Haibo Liu, Yanyan Cai, Jianwen Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title | Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title_full | Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title_fullStr | Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title_full_unstemmed | Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title_short | Conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
title_sort | conditional screening for ultrahigh-dimensional survival data in case-cohort studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8561435/ https://www.ncbi.nlm.nih.gov/pubmed/34417679 http://dx.doi.org/10.1007/s10985-021-09531-7 |
work_keys_str_mv | AT zhangjing conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies AT zhouhaibo conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies AT liuyanyan conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies AT caijianwen conditionalscreeningforultrahighdimensionalsurvivaldataincasecohortstudies |