Cargando…
Review of statistical methods for survival analysis using genomic data
Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korea Genome Organization
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944043/ https://www.ncbi.nlm.nih.gov/pubmed/31896241 http://dx.doi.org/10.5808/GI.2019.17.4.e41 |
_version_ | 1783484985917833216 |
---|---|
author | Lee, Seungyeoun Lim, Heeju |
author_facet | Lee, Seungyeoun Lim, Heeju |
author_sort | Lee, Seungyeoun |
collection | PubMed |
description | Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time to event. Only the occurrence of either time to event or censoring time is observed. Many traditional statistical methods have been effectively used for analyzing survival data with censored observations. However, with the development of high-throughput technologies for producing “omics” data, more advanced statistical methods, such as regularization, should be required to construct the predictive survival model with high-dimensional genomic data. Furthermore, machine learning approaches have been adapted for survival analysis, to fit nonlinear and complex interaction effects between predictors, and achieve more accurate prediction of individual survival probability. Presently, since most clinicians and medical researchers can easily assess statistical programs for analyzing survival data, a review article is helpful for understanding statistical methods used in survival analysis. We review traditional survival methods and regularization methods, with various penalty functions, for the analysis of high-dimensional genomics, and describe machine learning techniques that have been adapted to survival analysis. |
format | Online Article Text |
id | pubmed-6944043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Korea Genome Organization |
record_format | MEDLINE/PubMed |
spelling | pubmed-69440432020-01-09 Review of statistical methods for survival analysis using genomic data Lee, Seungyeoun Lim, Heeju Genomics Inform Review Article Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains “censored” data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time to event. Only the occurrence of either time to event or censoring time is observed. Many traditional statistical methods have been effectively used for analyzing survival data with censored observations. However, with the development of high-throughput technologies for producing “omics” data, more advanced statistical methods, such as regularization, should be required to construct the predictive survival model with high-dimensional genomic data. Furthermore, machine learning approaches have been adapted for survival analysis, to fit nonlinear and complex interaction effects between predictors, and achieve more accurate prediction of individual survival probability. Presently, since most clinicians and medical researchers can easily assess statistical programs for analyzing survival data, a review article is helpful for understanding statistical methods used in survival analysis. We review traditional survival methods and regularization methods, with various penalty functions, for the analysis of high-dimensional genomics, and describe machine learning techniques that have been adapted to survival analysis. Korea Genome Organization 2019-12-20 /pmc/articles/PMC6944043/ /pubmed/31896241 http://dx.doi.org/10.5808/GI.2019.17.4.e41 Text en (c) 2019, Korea Genome Organization (CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article Lee, Seungyeoun Lim, Heeju Review of statistical methods for survival analysis using genomic data |
title | Review of statistical methods for survival analysis using genomic data |
title_full | Review of statistical methods for survival analysis using genomic data |
title_fullStr | Review of statistical methods for survival analysis using genomic data |
title_full_unstemmed | Review of statistical methods for survival analysis using genomic data |
title_short | Review of statistical methods for survival analysis using genomic data |
title_sort | review of statistical methods for survival analysis using genomic data |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944043/ https://www.ncbi.nlm.nih.gov/pubmed/31896241 http://dx.doi.org/10.5808/GI.2019.17.4.e41 |
work_keys_str_mv | AT leeseungyeoun reviewofstatisticalmethodsforsurvivalanalysisusinggenomicdata AT limheeju reviewofstatisticalmethodsforsurvivalanalysisusinggenomicdata |