Cargando…

Improved classification rates for localized algorithms under margin conditions

Support vector machines (SVMs) are one of the most successful algorithms on small and medium-sized data sets, but on large-scale data sets their training and predictions become computationally infeasible. The author considers a spatially defined data chunking method for large-scale learning problems...

Descripción completa

Detalles Bibliográficos
Autor principal: Blaschzyk, Ingrid Karin
Lenguaje:eng
Publicado: Springer 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1007/978-3-658-29591-2
http://cds.cern.ch/record/2717179
_version_ 1780965585218174976
author Blaschzyk, Ingrid Karin
author_facet Blaschzyk, Ingrid Karin
author_sort Blaschzyk, Ingrid Karin
collection CERN
description Support vector machines (SVMs) are one of the most successful algorithms on small and medium-sized data sets, but on large-scale data sets their training and predictions become computationally infeasible. The author considers a spatially defined data chunking method for large-scale learning problems, leading to so-called localized SVMs, and implements an in-depth mathematical analysis with theoretical guarantees, which in particular include classification rates. The statistical analysis relies on a new and simple partitioning based technique and takes well-known margin conditions into account that describe the behavior of the data-generating distribution. It turns out that the rates outperform known rates of several other learning algorithms under suitable sets of assumptions. From a practical point of view, the author shows that a common training and validation procedure achieves the theoretical rates adaptively, that is, without knowing the margin parameters in advance. Contents Introduction to Statistical Learning Theory Histogram Rule: Oracle Inequality and Learning Rates Localized SVMs: Oracle Inequalities and Learning Rates Target Groups Researchers, students, and practitioners in the fields of mathematics and computer sciences who focus on machine learning or statistical learning theory The Author Ingrid Karin Blaschzyk is a postdoctoral researcher in the Department of Mathematics at the University of Stuttgart, Germany.
id cern-2717179
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
publisher Springer
record_format invenio
spelling cern-27171792021-04-21T18:08:06Zdoi:10.1007/978-3-658-29591-2http://cds.cern.ch/record/2717179engBlaschzyk, Ingrid KarinImproved classification rates for localized algorithms under margin conditionsMathematical Physics and MathematicsSupport vector machines (SVMs) are one of the most successful algorithms on small and medium-sized data sets, but on large-scale data sets their training and predictions become computationally infeasible. The author considers a spatially defined data chunking method for large-scale learning problems, leading to so-called localized SVMs, and implements an in-depth mathematical analysis with theoretical guarantees, which in particular include classification rates. The statistical analysis relies on a new and simple partitioning based technique and takes well-known margin conditions into account that describe the behavior of the data-generating distribution. It turns out that the rates outperform known rates of several other learning algorithms under suitable sets of assumptions. From a practical point of view, the author shows that a common training and validation procedure achieves the theoretical rates adaptively, that is, without knowing the margin parameters in advance. Contents Introduction to Statistical Learning Theory Histogram Rule: Oracle Inequality and Learning Rates Localized SVMs: Oracle Inequalities and Learning Rates Target Groups Researchers, students, and practitioners in the fields of mathematics and computer sciences who focus on machine learning or statistical learning theory The Author Ingrid Karin Blaschzyk is a postdoctoral researcher in the Department of Mathematics at the University of Stuttgart, Germany.Springeroai:cds.cern.ch:27171792020
spellingShingle Mathematical Physics and Mathematics
Blaschzyk, Ingrid Karin
Improved classification rates for localized algorithms under margin conditions
title Improved classification rates for localized algorithms under margin conditions
title_full Improved classification rates for localized algorithms under margin conditions
title_fullStr Improved classification rates for localized algorithms under margin conditions
title_full_unstemmed Improved classification rates for localized algorithms under margin conditions
title_short Improved classification rates for localized algorithms under margin conditions
title_sort improved classification rates for localized algorithms under margin conditions
topic Mathematical Physics and Mathematics
url https://dx.doi.org/10.1007/978-3-658-29591-2
http://cds.cern.ch/record/2717179
work_keys_str_mv AT blaschzykingridkarin improvedclassificationratesforlocalizedalgorithmsundermarginconditions