Cargando…
Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses
We consider a binary classification problem for a test sequence to determine from which source the sequence is generated. The system classifies the test sequence based on empirically observed (training) sequences obtained from unknown sources [Formula: see text] and [Formula: see text]. We analyze t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141706/ https://www.ncbi.nlm.nih.gov/pubmed/35626520 http://dx.doi.org/10.3390/e24050635 |
_version_ | 1784715408743333888 |
---|---|
author | Kuramata, Hiroto Yagi, Hideki |
author_facet | Kuramata, Hiroto Yagi, Hideki |
author_sort | Kuramata, Hiroto |
collection | PubMed |
description | We consider a binary classification problem for a test sequence to determine from which source the sequence is generated. The system classifies the test sequence based on empirically observed (training) sequences obtained from unknown sources [Formula: see text] and [Formula: see text]. We analyze the asymptotic fundamental limits of statistical classification for sources with multiple subclasses. We investigate the first- and second-order maximum error exponents under the constraint that the type-I error probability for all pairs of distributions decays exponentially fast and the type-II error probability is upper bounded by a small constant. In this paper, we first give a classifier which achieves the asymptotically maximum error exponent in the class of deterministic classifiers for sources with multiple subclasses, and then provide a characterization of the first-order error exponent. We next provide a characterization of the second-order error exponent in the case where only [Formula: see text] has multiple subclasses but [Formula: see text] does not. We generalize our results to classification in the case that [Formula: see text] and [Formula: see text] are a stationary and memoryless source and a mixed memoryless source with general mixture, respectively. |
format | Online Article Text |
id | pubmed-9141706 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91417062022-05-28 Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses Kuramata, Hiroto Yagi, Hideki Entropy (Basel) Article We consider a binary classification problem for a test sequence to determine from which source the sequence is generated. The system classifies the test sequence based on empirically observed (training) sequences obtained from unknown sources [Formula: see text] and [Formula: see text]. We analyze the asymptotic fundamental limits of statistical classification for sources with multiple subclasses. We investigate the first- and second-order maximum error exponents under the constraint that the type-I error probability for all pairs of distributions decays exponentially fast and the type-II error probability is upper bounded by a small constant. In this paper, we first give a classifier which achieves the asymptotically maximum error exponent in the class of deterministic classifiers for sources with multiple subclasses, and then provide a characterization of the first-order error exponent. We next provide a characterization of the second-order error exponent in the case where only [Formula: see text] has multiple subclasses but [Formula: see text] does not. We generalize our results to classification in the case that [Formula: see text] and [Formula: see text] are a stationary and memoryless source and a mixed memoryless source with general mixture, respectively. MDPI 2022-04-30 /pmc/articles/PMC9141706/ /pubmed/35626520 http://dx.doi.org/10.3390/e24050635 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kuramata, Hiroto Yagi, Hideki Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title | Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title_full | Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title_fullStr | Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title_full_unstemmed | Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title_short | Analysis on Optimal Error Exponents of Binary Classification for Source with Multiple Subclasses |
title_sort | analysis on optimal error exponents of binary classification for source with multiple subclasses |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141706/ https://www.ncbi.nlm.nih.gov/pubmed/35626520 http://dx.doi.org/10.3390/e24050635 |
work_keys_str_mv | AT kuramatahiroto analysisonoptimalerrorexponentsofbinaryclassificationforsourcewithmultiplesubclasses AT yagihideki analysisonoptimalerrorexponentsofbinaryclassificationforsourcewithmultiplesubclasses |