Cargando…

The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States

BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xiang, Wang, Hui, Lyu, Weixuan, Xu, Ran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9424808/
https://www.ncbi.nlm.nih.gov/pubmed/36042407
http://dx.doi.org/10.1186/s12874-022-01714-6
_version_ 1784778305642168320
author Chen, Xiang
Wang, Hui
Lyu, Weixuan
Xu, Ran
author_facet Chen, Xiang
Wang, Hui
Lyu, Weixuan
Xu, Ran
author_sort Chen, Xiang
collection PubMed
description BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that observations are normally distributed and that the trend line cannot have extreme variability. To overcome the limitations of the parametric model, we apply a nonparametric method, the Mann-Kendall-Sneyers (MKS) test, to change point detection for the state-level COVID-19 case time series data of the United States in the early outbreak of the pandemic. METHODS: The MKS test is implemented for change point detection. The forward sequence and the backward sequence are calculated based on the new weekly cases between March 22, 2020 and January 31, 2021 for each of the 50 states. Points of intersection between the two sequences falling within the 95% confidence intervals are identified as the change points. The results are compared with two other change point detection methods, the pruned exact linear time (PELT) method and the regression-based method. Also, an open-access tool by Microsoft Excel is developed to facilitate the model implementation. RESULTS: By applying the MKS test to COVID-19 cases in the United States, we have identified that 30 states (60.0%) have at least one change point within the 95% confidence intervals. Of these states, 26 states have one change point, 4 states (i.e., LA, OH, VA, and WA) have two change points, and one state (GA) has three change points. Additionally, most downward changes appear in the Northeastern states (e.g., CT, MA, NJ, NY) at the first development stage (March 23 through May 31, 2020); most upward changes appear in the Western states (e.g., AZ, CA, CO, NM, WA, WY) and the Midwestern states (e.g., IL, IN, MI, MN, OH, WI) at the third development stage (November 19, 2020 through January 31, 2021). CONCLUSIONS: This study is among the first to explore the potential of the MKS test applied for change point detection of COVID-19 cases. The MKS test is characterized by several advantages, including high computational efficiency, easy implementation, the ability to identify the change of direction, and no assumption for data distribution. However, due to its conservative nature in change point detection and moderate agreement with other methods, we recommend using the MKS test primarily for initial pattern identification and data pruning, especially in large data. With modification, the method can be further applied to other health data, such as injuries, disabilities, and mortalities.
format Online
Article
Text
id pubmed-9424808
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94248082022-08-30 The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States Chen, Xiang Wang, Hui Lyu, Weixuan Xu, Ran BMC Med Res Methodol Research BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that observations are normally distributed and that the trend line cannot have extreme variability. To overcome the limitations of the parametric model, we apply a nonparametric method, the Mann-Kendall-Sneyers (MKS) test, to change point detection for the state-level COVID-19 case time series data of the United States in the early outbreak of the pandemic. METHODS: The MKS test is implemented for change point detection. The forward sequence and the backward sequence are calculated based on the new weekly cases between March 22, 2020 and January 31, 2021 for each of the 50 states. Points of intersection between the two sequences falling within the 95% confidence intervals are identified as the change points. The results are compared with two other change point detection methods, the pruned exact linear time (PELT) method and the regression-based method. Also, an open-access tool by Microsoft Excel is developed to facilitate the model implementation. RESULTS: By applying the MKS test to COVID-19 cases in the United States, we have identified that 30 states (60.0%) have at least one change point within the 95% confidence intervals. Of these states, 26 states have one change point, 4 states (i.e., LA, OH, VA, and WA) have two change points, and one state (GA) has three change points. Additionally, most downward changes appear in the Northeastern states (e.g., CT, MA, NJ, NY) at the first development stage (March 23 through May 31, 2020); most upward changes appear in the Western states (e.g., AZ, CA, CO, NM, WA, WY) and the Midwestern states (e.g., IL, IN, MI, MN, OH, WI) at the third development stage (November 19, 2020 through January 31, 2021). CONCLUSIONS: This study is among the first to explore the potential of the MKS test applied for change point detection of COVID-19 cases. The MKS test is characterized by several advantages, including high computational efficiency, easy implementation, the ability to identify the change of direction, and no assumption for data distribution. However, due to its conservative nature in change point detection and moderate agreement with other methods, we recommend using the MKS test primarily for initial pattern identification and data pruning, especially in large data. With modification, the method can be further applied to other health data, such as injuries, disabilities, and mortalities. BioMed Central 2022-08-30 /pmc/articles/PMC9424808/ /pubmed/36042407 http://dx.doi.org/10.1186/s12874-022-01714-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chen, Xiang
Wang, Hui
Lyu, Weixuan
Xu, Ran
The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title_full The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title_fullStr The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title_full_unstemmed The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title_short The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
title_sort mann-kendall-sneyers test to identify the change points of covid-19 time series in the united states
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9424808/
https://www.ncbi.nlm.nih.gov/pubmed/36042407
http://dx.doi.org/10.1186/s12874-022-01714-6
work_keys_str_mv AT chenxiang themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT wanghui themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT lyuweixuan themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT xuran themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT chenxiang mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT wanghui mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT lyuweixuan mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates
AT xuran mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates