Cargando…
The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States
BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9424808/ https://www.ncbi.nlm.nih.gov/pubmed/36042407 http://dx.doi.org/10.1186/s12874-022-01714-6 |
_version_ | 1784778305642168320 |
---|---|
author | Chen, Xiang Wang, Hui Lyu, Weixuan Xu, Ran |
author_facet | Chen, Xiang Wang, Hui Lyu, Weixuan Xu, Ran |
author_sort | Chen, Xiang |
collection | PubMed |
description | BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that observations are normally distributed and that the trend line cannot have extreme variability. To overcome the limitations of the parametric model, we apply a nonparametric method, the Mann-Kendall-Sneyers (MKS) test, to change point detection for the state-level COVID-19 case time series data of the United States in the early outbreak of the pandemic. METHODS: The MKS test is implemented for change point detection. The forward sequence and the backward sequence are calculated based on the new weekly cases between March 22, 2020 and January 31, 2021 for each of the 50 states. Points of intersection between the two sequences falling within the 95% confidence intervals are identified as the change points. The results are compared with two other change point detection methods, the pruned exact linear time (PELT) method and the regression-based method. Also, an open-access tool by Microsoft Excel is developed to facilitate the model implementation. RESULTS: By applying the MKS test to COVID-19 cases in the United States, we have identified that 30 states (60.0%) have at least one change point within the 95% confidence intervals. Of these states, 26 states have one change point, 4 states (i.e., LA, OH, VA, and WA) have two change points, and one state (GA) has three change points. Additionally, most downward changes appear in the Northeastern states (e.g., CT, MA, NJ, NY) at the first development stage (March 23 through May 31, 2020); most upward changes appear in the Western states (e.g., AZ, CA, CO, NM, WA, WY) and the Midwestern states (e.g., IL, IN, MI, MN, OH, WI) at the third development stage (November 19, 2020 through January 31, 2021). CONCLUSIONS: This study is among the first to explore the potential of the MKS test applied for change point detection of COVID-19 cases. The MKS test is characterized by several advantages, including high computational efficiency, easy implementation, the ability to identify the change of direction, and no assumption for data distribution. However, due to its conservative nature in change point detection and moderate agreement with other methods, we recommend using the MKS test primarily for initial pattern identification and data pruning, especially in large data. With modification, the method can be further applied to other health data, such as injuries, disabilities, and mortalities. |
format | Online Article Text |
id | pubmed-9424808 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94248082022-08-30 The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States Chen, Xiang Wang, Hui Lyu, Weixuan Xu, Ran BMC Med Res Methodol Research BACKGROUND: One critical variable in the time series analysis is the change point, which is the point where an abrupt change occurs in chronologically ordered observations. Existing parametric models for change point detection, such as the linear regression model and the Bayesian model, require that observations are normally distributed and that the trend line cannot have extreme variability. To overcome the limitations of the parametric model, we apply a nonparametric method, the Mann-Kendall-Sneyers (MKS) test, to change point detection for the state-level COVID-19 case time series data of the United States in the early outbreak of the pandemic. METHODS: The MKS test is implemented for change point detection. The forward sequence and the backward sequence are calculated based on the new weekly cases between March 22, 2020 and January 31, 2021 for each of the 50 states. Points of intersection between the two sequences falling within the 95% confidence intervals are identified as the change points. The results are compared with two other change point detection methods, the pruned exact linear time (PELT) method and the regression-based method. Also, an open-access tool by Microsoft Excel is developed to facilitate the model implementation. RESULTS: By applying the MKS test to COVID-19 cases in the United States, we have identified that 30 states (60.0%) have at least one change point within the 95% confidence intervals. Of these states, 26 states have one change point, 4 states (i.e., LA, OH, VA, and WA) have two change points, and one state (GA) has three change points. Additionally, most downward changes appear in the Northeastern states (e.g., CT, MA, NJ, NY) at the first development stage (March 23 through May 31, 2020); most upward changes appear in the Western states (e.g., AZ, CA, CO, NM, WA, WY) and the Midwestern states (e.g., IL, IN, MI, MN, OH, WI) at the third development stage (November 19, 2020 through January 31, 2021). CONCLUSIONS: This study is among the first to explore the potential of the MKS test applied for change point detection of COVID-19 cases. The MKS test is characterized by several advantages, including high computational efficiency, easy implementation, the ability to identify the change of direction, and no assumption for data distribution. However, due to its conservative nature in change point detection and moderate agreement with other methods, we recommend using the MKS test primarily for initial pattern identification and data pruning, especially in large data. With modification, the method can be further applied to other health data, such as injuries, disabilities, and mortalities. BioMed Central 2022-08-30 /pmc/articles/PMC9424808/ /pubmed/36042407 http://dx.doi.org/10.1186/s12874-022-01714-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chen, Xiang Wang, Hui Lyu, Weixuan Xu, Ran The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title | The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title_full | The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title_fullStr | The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title_full_unstemmed | The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title_short | The Mann-Kendall-Sneyers test to identify the change points of COVID-19 time series in the United States |
title_sort | mann-kendall-sneyers test to identify the change points of covid-19 time series in the united states |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9424808/ https://www.ncbi.nlm.nih.gov/pubmed/36042407 http://dx.doi.org/10.1186/s12874-022-01714-6 |
work_keys_str_mv | AT chenxiang themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT wanghui themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT lyuweixuan themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT xuran themannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT chenxiang mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT wanghui mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT lyuweixuan mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates AT xuran mannkendallsneyerstesttoidentifythechangepointsofcovid19timeseriesintheunitedstates |