Cargando…

On optimal multiple changepoint algorithms for large data

Many common approaches to detecting changepoints, for example based on statistical criteria such as penalised likelihood or minimum description length, can be formulated in terms of minimising a cost over segmentations. We focus on a class of dynamic programming algorithms that can solve the resulti...

Descripción completa

Detalles Bibliográficos
Autores principales: Maidstone, Robert, Hocking, Toby, Rigaill, Guillem, Fearnhead, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7175693/
https://www.ncbi.nlm.nih.gov/pubmed/32355427
http://dx.doi.org/10.1007/s11222-016-9636-3
Descripción
Sumario:Many common approaches to detecting changepoints, for example based on statistical criteria such as penalised likelihood or minimum description length, can be formulated in terms of minimising a cost over segmentations. We focus on a class of dynamic programming algorithms that can solve the resulting minimisation problem exactly, and thus find the optimal segmentation under the given statistical criteria. The standard implementation of these dynamic programming methods have a computational cost that scales at least quadratically in the length of the time-series. Recently pruning ideas have been suggested that can speed up the dynamic programming algorithms, whilst still being guaranteed to be optimal, in that they find the true minimum of the cost function. Here we extend these pruning methods, and introduce two new algorithms for segmenting data: FPOP and SNIP. Empirical results show that FPOP is substantially faster than existing dynamic programming methods, and unlike the existing methods its computational efficiency is robust to the number of changepoints in the data. We evaluate the method for detecting copy number variations and observe that FPOP has a computational cost that is even competitive with that of binary segmentation, but can give much more accurate segmentations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11222-016-9636-3) contains supplementary material, which is available to authorized users.