Cargando…
A CMAC-based scheme for determining membership with classification of text strings
Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4993807/ https://www.ncbi.nlm.nih.gov/pubmed/27616819 http://dx.doi.org/10.1007/s00521-015-1989-6 |
_version_ | 1782449196969754624 |
---|---|
author | Ma, Heng Tseng, Ying-Chih Chen, Lu-I. |
author_facet | Ma, Heng Tseng, Ying-Chih Chen, Lu-I. |
author_sort | Ma, Heng |
collection | PubMed |
description | Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination procedure. As determination of membership with classification is becoming increasingly desirable, parallel Bloom filters are often implemented for facilitating the additional classification requirement. The parallel Bloom filters, however, tend to produce additional false-positive errors since membership determination must be performed on each of the parallel layers. We propose a scheme based on CMAC, a neural network mapping, which only requires a single-layer calculation to simultaneously obtain information of both the membership and classification. A hash function specifically designed for text strings is also proposed. The proposed scheme could effectively reduce false-positive errors by converging the range of membership acceptance to the minimum for each class during the neural network mapping. Simulation results show that the proposed scheme committed significantly less errors than the benchmark, parallel Bloom filters, with limited and identical memory usage at different classification levels. |
format | Online Article Text |
id | pubmed-4993807 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-49938072016-09-07 A CMAC-based scheme for determining membership with classification of text strings Ma, Heng Tseng, Ying-Chih Chen, Lu-I. Neural Comput Appl Original Article Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination procedure. As determination of membership with classification is becoming increasingly desirable, parallel Bloom filters are often implemented for facilitating the additional classification requirement. The parallel Bloom filters, however, tend to produce additional false-positive errors since membership determination must be performed on each of the parallel layers. We propose a scheme based on CMAC, a neural network mapping, which only requires a single-layer calculation to simultaneously obtain information of both the membership and classification. A hash function specifically designed for text strings is also proposed. The proposed scheme could effectively reduce false-positive errors by converging the range of membership acceptance to the minimum for each class during the neural network mapping. Simulation results show that the proposed scheme committed significantly less errors than the benchmark, parallel Bloom filters, with limited and identical memory usage at different classification levels. Springer London 2015-07-10 2016 /pmc/articles/PMC4993807/ /pubmed/27616819 http://dx.doi.org/10.1007/s00521-015-1989-6 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Original Article Ma, Heng Tseng, Ying-Chih Chen, Lu-I. A CMAC-based scheme for determining membership with classification of text strings |
title | A CMAC-based scheme for determining membership with classification of text strings |
title_full | A CMAC-based scheme for determining membership with classification of text strings |
title_fullStr | A CMAC-based scheme for determining membership with classification of text strings |
title_full_unstemmed | A CMAC-based scheme for determining membership with classification of text strings |
title_short | A CMAC-based scheme for determining membership with classification of text strings |
title_sort | cmac-based scheme for determining membership with classification of text strings |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4993807/ https://www.ncbi.nlm.nih.gov/pubmed/27616819 http://dx.doi.org/10.1007/s00521-015-1989-6 |
work_keys_str_mv | AT maheng acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings AT tsengyingchih acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings AT chenlui acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings AT maheng cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings AT tsengyingchih cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings AT chenlui cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings |