Cargando…

Review of statistical methods for survival analysis using genomic data

Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Seungyeoun, Lim, Heeju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944043/
https://www.ncbi.nlm.nih.gov/pubmed/31896241
http://dx.doi.org/10.5808/GI.2019.17.4.e41
_version_ 1783484985917833216
author Lee, Seungyeoun
Lim, Heeju
author_facet Lee, Seungyeoun
Lim, Heeju
author_sort Lee, Seungyeoun
collection PubMed
description Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time to event. Only the occurrence of either time to event or censoring time is observed. Many traditional statistical methods have been effectively used for analyzing survival data with censored observations. However, with the development of high-throughput technologies for producing “omics” data, more advanced statistical methods, such as regularization, should be required to construct the predictive survival model with high-dimensional genomic data. Furthermore, machine learning approaches have been adapted for survival analysis, to fit nonlinear and complex interaction effects between predictors, and achieve more accurate prediction of individual survival probability. Presently, since most clinicians and medical researchers can easily assess statistical programs for analyzing survival data, a review article is helpful for understanding statistical methods used in survival analysis. We review traditional survival methods and regularization methods, with various penalty functions, for the analysis of high-dimensional genomics, and describe machine learning techniques that have been adapted to survival analysis.
format Online
Article
Text
id pubmed-6944043
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-69440432020-01-09 Review of statistical methods for survival analysis using genomic data Lee, Seungyeoun Lim, Heeju Genomics Inform Review Article Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time to event. Only the occurrence of either time to event or censoring time is observed. Many traditional statistical methods have been effectively used for analyzing survival data with censored observations. However, with the development of high-throughput technologies for producing “omics” data, more advanced statistical methods, such as regularization, should be required to construct the predictive survival model with high-dimensional genomic data. Furthermore, machine learning approaches have been adapted for survival analysis, to fit nonlinear and complex interaction effects between predictors, and achieve more accurate prediction of individual survival probability. Presently, since most clinicians and medical researchers can easily assess statistical programs for analyzing survival data, a review article is helpful for understanding statistical methods used in survival analysis. We review traditional survival methods and regularization methods, with various penalty functions, for the analysis of high-dimensional genomics, and describe machine learning techniques that have been adapted to survival analysis. Korea Genome Organization 2019-12-20 /pmc/articles/PMC6944043/ /pubmed/31896241 http://dx.doi.org/10.5808/GI.2019.17.4.e41 Text en (c) 2019, Korea Genome Organization (CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Lee, Seungyeoun
Lim, Heeju
Review of statistical methods for survival analysis using genomic data
title Review of statistical methods for survival analysis using genomic data
title_full Review of statistical methods for survival analysis using genomic data
title_fullStr Review of statistical methods for survival analysis using genomic data
title_full_unstemmed Review of statistical methods for survival analysis using genomic data
title_short Review of statistical methods for survival analysis using genomic data
title_sort review of statistical methods for survival analysis using genomic data
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944043/
https://www.ncbi.nlm.nih.gov/pubmed/31896241
http://dx.doi.org/10.5808/GI.2019.17.4.e41
work_keys_str_mv AT leeseungyeoun reviewofstatisticalmethodsforsurvivalanalysisusinggenomicdata
AT limheeju reviewofstatisticalmethodsforsurvivalanalysisusinggenomicdata