Cargando…

A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol

In this paper, based on our previous multi-pattern uniform resource locator (URL) binary-matching algorithm called HEM, we propose an improved multi-pattern matching algorithm called MH that is based on hash tables and binary tables. The MH algorithm can be applied to the fields of network security,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Ping, Tan, Qingping, Meng, Xiankai, Shao, Zeming, Xie, Qinzheng, Yan, Ying, Cao, Wei, Xu, Jianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388485/
https://www.ncbi.nlm.nih.gov/pubmed/28399157
http://dx.doi.org/10.1371/journal.pone.0175500
_version_ 1782521137169694720
author Zeng, Ping
Tan, Qingping
Meng, Xiankai
Shao, Zeming
Xie, Qinzheng
Yan, Ying
Cao, Wei
Xu, Jianjun
author_facet Zeng, Ping
Tan, Qingping
Meng, Xiankai
Shao, Zeming
Xie, Qinzheng
Yan, Ying
Cao, Wei
Xu, Jianjun
author_sort Zeng, Ping
collection PubMed
description In this paper, based on our previous multi-pattern uniform resource locator (URL) binary-matching algorithm called HEM, we propose an improved multi-pattern matching algorithm called MH that is based on hash tables and binary tables. The MH algorithm can be applied to the fields of network security, data analysis, load balancing, cloud robotic communications, and so on—all of which require string matching from a fixed starting position. Our approach effectively solves the performance problems of the classical multi-pattern matching algorithms. This paper explores ways to improve string matching performance under the HTTP protocol by using a hash method combined with a binary method that transforms the symbol-space matching problem into a digital-space numerical-size comparison and hashing problem. The MH approach has a fast matching speed, requires little memory, performs better than both the classical algorithms and HEM for matching fields in an HTTP stream, and it has great promise for use in real-world applications.
format Online
Article
Text
id pubmed-5388485
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53884852017-05-03 A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol Zeng, Ping Tan, Qingping Meng, Xiankai Shao, Zeming Xie, Qinzheng Yan, Ying Cao, Wei Xu, Jianjun PLoS One Research Article In this paper, based on our previous multi-pattern uniform resource locator (URL) binary-matching algorithm called HEM, we propose an improved multi-pattern matching algorithm called MH that is based on hash tables and binary tables. The MH algorithm can be applied to the fields of network security, data analysis, load balancing, cloud robotic communications, and so on—all of which require string matching from a fixed starting position. Our approach effectively solves the performance problems of the classical multi-pattern matching algorithms. This paper explores ways to improve string matching performance under the HTTP protocol by using a hash method combined with a binary method that transforms the symbol-space matching problem into a digital-space numerical-size comparison and hashing problem. The MH approach has a fast matching speed, requires little memory, performs better than both the classical algorithms and HEM for matching fields in an HTTP stream, and it has great promise for use in real-world applications. Public Library of Science 2017-04-11 /pmc/articles/PMC5388485/ /pubmed/28399157 http://dx.doi.org/10.1371/journal.pone.0175500 Text en © 2017 Zeng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zeng, Ping
Tan, Qingping
Meng, Xiankai
Shao, Zeming
Xie, Qinzheng
Yan, Ying
Cao, Wei
Xu, Jianjun
A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title_full A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title_fullStr A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title_full_unstemmed A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title_short A multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
title_sort multi-pattern hash-binary hybrid algorithm for url matching in the http protocol
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388485/
https://www.ncbi.nlm.nih.gov/pubmed/28399157
http://dx.doi.org/10.1371/journal.pone.0175500
work_keys_str_mv AT zengping amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT tanqingping amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT mengxiankai amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT shaozeming amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT xieqinzheng amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT yanying amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT caowei amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT xujianjun amultipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT zengping multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT tanqingping multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT mengxiankai multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT shaozeming multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT xieqinzheng multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT yanying multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT caowei multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol
AT xujianjun multipatternhashbinaryhybridalgorithmforurlmatchinginthehttpprotocol