Cargando…

A CMAC-based scheme for determining membership with classification of text strings

Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Heng, Tseng, Ying-Chih, Chen, Lu-I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4993807/
https://www.ncbi.nlm.nih.gov/pubmed/27616819
http://dx.doi.org/10.1007/s00521-015-1989-6
_version_ 1782449196969754624
author Ma, Heng
Tseng, Ying-Chih
Chen, Lu-I.
author_facet Ma, Heng
Tseng, Ying-Chih
Chen, Lu-I.
author_sort Ma, Heng
collection PubMed
description Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination procedure. As determination of membership with classification is becoming increasingly desirable, parallel Bloom filters are often implemented for facilitating the additional classification requirement. The parallel Bloom filters, however, tend to produce additional false-positive errors since membership determination must be performed on each of the parallel layers. We propose a scheme based on CMAC, a neural network mapping, which only requires a single-layer calculation to simultaneously obtain information of both the membership and classification. A hash function specifically designed for text strings is also proposed. The proposed scheme could effectively reduce false-positive errors by converging the range of membership acceptance to the minimum for each class during the neural network mapping. Simulation results show that the proposed scheme committed significantly less errors than the benchmark, parallel Bloom filters, with limited and identical memory usage at different classification levels.
format Online
Article
Text
id pubmed-4993807
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-49938072016-09-07 A CMAC-based scheme for determining membership with classification of text strings Ma, Heng Tseng, Ying-Chih Chen, Lu-I. Neural Comput Appl Original Article Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination procedure. As determination of membership with classification is becoming increasingly desirable, parallel Bloom filters are often implemented for facilitating the additional classification requirement. The parallel Bloom filters, however, tend to produce additional false-positive errors since membership determination must be performed on each of the parallel layers. We propose a scheme based on CMAC, a neural network mapping, which only requires a single-layer calculation to simultaneously obtain information of both the membership and classification. A hash function specifically designed for text strings is also proposed. The proposed scheme could effectively reduce false-positive errors by converging the range of membership acceptance to the minimum for each class during the neural network mapping. Simulation results show that the proposed scheme committed significantly less errors than the benchmark, parallel Bloom filters, with limited and identical memory usage at different classification levels. Springer London 2015-07-10 2016 /pmc/articles/PMC4993807/ /pubmed/27616819 http://dx.doi.org/10.1007/s00521-015-1989-6 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Ma, Heng
Tseng, Ying-Chih
Chen, Lu-I.
A CMAC-based scheme for determining membership with classification of text strings
title A CMAC-based scheme for determining membership with classification of text strings
title_full A CMAC-based scheme for determining membership with classification of text strings
title_fullStr A CMAC-based scheme for determining membership with classification of text strings
title_full_unstemmed A CMAC-based scheme for determining membership with classification of text strings
title_short A CMAC-based scheme for determining membership with classification of text strings
title_sort cmac-based scheme for determining membership with classification of text strings
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4993807/
https://www.ncbi.nlm.nih.gov/pubmed/27616819
http://dx.doi.org/10.1007/s00521-015-1989-6
work_keys_str_mv AT maheng acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings
AT tsengyingchih acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings
AT chenlui acmacbasedschemefordeterminingmembershipwithclassificationoftextstrings
AT maheng cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings
AT tsengyingchih cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings
AT chenlui cmacbasedschemefordeterminingmembershipwithclassificationoftextstrings