Cargando…

On the stability of log-rank test under labeling errors

MOTIVATION: Log-rank test is a widely used test that serves to assess the statistical significance of observed differences in survival, when comparing two or more groups. The log-rank test is based on several assumptions that support the validity of the calculations. It is naturally assumed, implici...

Descripción completa

Detalles Bibliográficos
Autores principales: Galili, Ben, Samohi, Anat, Yakhini, Zohar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8652036/
https://www.ncbi.nlm.nih.gov/pubmed/34255820
http://dx.doi.org/10.1093/bioinformatics/btab495
_version_ 1784611504896606208
author Galili, Ben
Samohi, Anat
Yakhini, Zohar
author_facet Galili, Ben
Samohi, Anat
Yakhini, Zohar
author_sort Galili, Ben
collection PubMed
description MOTIVATION: Log-rank test is a widely used test that serves to assess the statistical significance of observed differences in survival, when comparing two or more groups. The log-rank test is based on several assumptions that support the validity of the calculations. It is naturally assumed, implicitly, that no errors occur in the labeling of the samples. That is, the mapping between samples and groups is perfectly correct. In this work, we investigate how test results may be affected when considering some errors in the original labeling. RESULTS: We introduce and define the uncertainty that arises from labeling errors in log-rank test. In order to deal with this uncertainty, we develop a novel algorithm for efficiently calculating a stability interval around the original log-rank P-value and prove its correctness. We demonstrate our algorithm on several datasets. AVAILABILITY AND IMPLEMENTATION: We provide a Python implementation, called LoRSI, for calculating the stability interval using our algorithm https://github.com/YakhiniGroup/LoRSI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8652036
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86520362021-12-08 On the stability of log-rank test under labeling errors Galili, Ben Samohi, Anat Yakhini, Zohar Bioinformatics Original Papers MOTIVATION: Log-rank test is a widely used test that serves to assess the statistical significance of observed differences in survival, when comparing two or more groups. The log-rank test is based on several assumptions that support the validity of the calculations. It is naturally assumed, implicitly, that no errors occur in the labeling of the samples. That is, the mapping between samples and groups is perfectly correct. In this work, we investigate how test results may be affected when considering some errors in the original labeling. RESULTS: We introduce and define the uncertainty that arises from labeling errors in log-rank test. In order to deal with this uncertainty, we develop a novel algorithm for efficiently calculating a stability interval around the original log-rank P-value and prove its correctness. We demonstrate our algorithm on several datasets. AVAILABILITY AND IMPLEMENTATION: We provide a Python implementation, called LoRSI, for calculating the stability interval using our algorithm https://github.com/YakhiniGroup/LoRSI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-13 /pmc/articles/PMC8652036/ /pubmed/34255820 http://dx.doi.org/10.1093/bioinformatics/btab495 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Galili, Ben
Samohi, Anat
Yakhini, Zohar
On the stability of log-rank test under labeling errors
title On the stability of log-rank test under labeling errors
title_full On the stability of log-rank test under labeling errors
title_fullStr On the stability of log-rank test under labeling errors
title_full_unstemmed On the stability of log-rank test under labeling errors
title_short On the stability of log-rank test under labeling errors
title_sort on the stability of log-rank test under labeling errors
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8652036/
https://www.ncbi.nlm.nih.gov/pubmed/34255820
http://dx.doi.org/10.1093/bioinformatics/btab495
work_keys_str_mv AT galiliben onthestabilityoflogranktestunderlabelingerrors
AT samohianat onthestabilityoflogranktestunderlabelingerrors
AT yakhinizohar onthestabilityoflogranktestunderlabelingerrors