Cargando…

Algorithms for Anomaly Detection - Lecture 1

<!--HTML-->The concept of statistical anomalies, or outliers, has fascinated experimentalists since the earliest attempts to interpret data. We want to know why some data points don’t seem to belong with the others: perhaps we want to eliminate spurious or unrepresentative data from our model....

Descripción completa

Detalles Bibliográficos
Autor principal: Davis, Michael
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:http://cds.cern.ch/record/2254858
_version_ 1780953669088313344
author Davis, Michael
author_facet Davis, Michael
author_sort Davis, Michael
collection CERN
description <!--HTML-->The concept of statistical anomalies, or outliers, has fascinated experimentalists since the earliest attempts to interpret data. We want to know why some data points don’t seem to belong with the others: perhaps we want to eliminate spurious or unrepresentative data from our model. Or, the anomalies themselves may be what we are interested in: an outlier could represent the symptom of a disease, an attack on a computer network, a scientific discovery, or even an unfaithful partner. We start with some general considerations, such as the relationship between clustering and anomaly detection, the choice between supervised and unsupervised methods, and the difference between global and local anomalies. Then we will survey the most representative anomaly detection algorithms, highlighting what kind of data each approach is best suited to, and discussing their limitations. We will finish with a discussion of the difficulties of anomaly detection in high-dimensional data and some new directions for anomaly detection research.
id cern-2254858
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22548582022-11-02T22:32:27Zhttp://cds.cern.ch/record/2254858engDavis, MichaelAlgorithms for Anomaly Detection - Lecture 1Inverted CERN School of Computing 2017inverted CSC<!--HTML-->The concept of statistical anomalies, or outliers, has fascinated experimentalists since the earliest attempts to interpret data. We want to know why some data points don’t seem to belong with the others: perhaps we want to eliminate spurious or unrepresentative data from our model. Or, the anomalies themselves may be what we are interested in: an outlier could represent the symptom of a disease, an attack on a computer network, a scientific discovery, or even an unfaithful partner. We start with some general considerations, such as the relationship between clustering and anomaly detection, the choice between supervised and unsupervised methods, and the difference between global and local anomalies. Then we will survey the most representative anomaly detection algorithms, highlighting what kind of data each approach is best suited to, and discussing their limitations. We will finish with a discussion of the difficulties of anomaly detection in high-dimensional data and some new directions for anomaly detection research.oai:cds.cern.ch:22548582017
spellingShingle inverted CSC
Davis, Michael
Algorithms for Anomaly Detection - Lecture 1
title Algorithms for Anomaly Detection - Lecture 1
title_full Algorithms for Anomaly Detection - Lecture 1
title_fullStr Algorithms for Anomaly Detection - Lecture 1
title_full_unstemmed Algorithms for Anomaly Detection - Lecture 1
title_short Algorithms for Anomaly Detection - Lecture 1
title_sort algorithms for anomaly detection - lecture 1
topic inverted CSC
url http://cds.cern.ch/record/2254858
work_keys_str_mv AT davismichael algorithmsforanomalydetectionlecture1
AT davismichael invertedcernschoolofcomputing2017