Cargando…

Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces

Novelty detection involves the construction of a “model of normality”, and then classifies test data as being either “normal” or “abnormal” with respect to that model. For this reason, it is often termed one-class classification. The approach is suitable for cases in which examples of “normal” behav...

Descripción completa

Detalles Bibliográficos
Autores principales: Clifton, David A., Clifton, Lei, Hugueny, Samuel, Tarassenko, Lionel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963457/
https://www.ncbi.nlm.nih.gov/pubmed/24683434
http://dx.doi.org/10.1007/s11265-013-0835-2
_version_ 1782308513125498880
author Clifton, David A.
Clifton, Lei
Hugueny, Samuel
Tarassenko, Lionel
author_facet Clifton, David A.
Clifton, Lei
Hugueny, Samuel
Tarassenko, Lionel
author_sort Clifton, David A.
collection PubMed
description Novelty detection involves the construction of a “model of normality”, and then classifies test data as being either “normal” or “abnormal” with respect to that model. For this reason, it is often termed one-class classification. The approach is suitable for cases in which examples of “normal” behaviour are commonly available, but in which cases of “abnormal” data are comparatively rare. When performing novelty detection, we are typically most interested in the tails of the normal model, because it is in these tails that a decision boundary between “normal” and “abnormal” areas of data space usually lies. Extreme value statistics provides an appropriate theoretical framework for modelling the tails of univariate (or low-dimensional) distributions, using the generalised Pareto distribution (GPD), which can be demonstrated to be the limiting distribution for data occurring within the tails of most practically-encountered probability distributions. This paper provides an extension of the GPD, allowing the modelling of probability distributions of arbitrarily high dimension, such as occurs when using complex, multimodel, multivariate distributions for performing novelty detection in most real-life cases. We demonstrate our extension to the GPD using examples from patient physiological monitoring, in which we have acquired data from hospital patients in large clinical studies of high-acuity wards, and in which we wish to determine “abnormal” patient data, such that early warning of patient physiological deterioration may be provided.
format Online
Article
Text
id pubmed-3963457
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-39634572014-03-28 Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces Clifton, David A. Clifton, Lei Hugueny, Samuel Tarassenko, Lionel J Signal Process Syst Article Novelty detection involves the construction of a “model of normality”, and then classifies test data as being either “normal” or “abnormal” with respect to that model. For this reason, it is often termed one-class classification. The approach is suitable for cases in which examples of “normal” behaviour are commonly available, but in which cases of “abnormal” data are comparatively rare. When performing novelty detection, we are typically most interested in the tails of the normal model, because it is in these tails that a decision boundary between “normal” and “abnormal” areas of data space usually lies. Extreme value statistics provides an appropriate theoretical framework for modelling the tails of univariate (or low-dimensional) distributions, using the generalised Pareto distribution (GPD), which can be demonstrated to be the limiting distribution for data occurring within the tails of most practically-encountered probability distributions. This paper provides an extension of the GPD, allowing the modelling of probability distributions of arbitrarily high dimension, such as occurs when using complex, multimodel, multivariate distributions for performing novelty detection in most real-life cases. We demonstrate our extension to the GPD using examples from patient physiological monitoring, in which we have acquired data from hospital patients in large clinical studies of high-acuity wards, and in which we wish to determine “abnormal” patient data, such that early warning of patient physiological deterioration may be provided. Springer US 2013-08-16 2014 /pmc/articles/PMC3963457/ /pubmed/24683434 http://dx.doi.org/10.1007/s11265-013-0835-2 Text en © The Author(s) 2013 https://creativecommons.org/licenses/by/2.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Article
Clifton, David A.
Clifton, Lei
Hugueny, Samuel
Tarassenko, Lionel
Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title_full Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title_fullStr Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title_full_unstemmed Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title_short Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces
title_sort extending the generalised pareto distribution for novelty detection in high-dimensional spaces
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963457/
https://www.ncbi.nlm.nih.gov/pubmed/24683434
http://dx.doi.org/10.1007/s11265-013-0835-2
work_keys_str_mv AT cliftondavida extendingthegeneralisedparetodistributionfornoveltydetectioninhighdimensionalspaces
AT cliftonlei extendingthegeneralisedparetodistributionfornoveltydetectioninhighdimensionalspaces
AT huguenysamuel extendingthegeneralisedparetodistributionfornoveltydetectioninhighdimensionalspaces
AT tarassenkolionel extendingthegeneralisedparetodistributionfornoveltydetectioninhighdimensionalspaces