Cargando…
Entropy-Based Approach in Selection Exact String-Matching Algorithms
The string-matching paradigm is applied in every computer science and science branch in general. The existence of a plethora of string-matching algorithms makes it hard to choose the best one for any particular case. Expressing, measuring, and testing algorithm efficiency is a challenging task with...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7824336/ https://www.ncbi.nlm.nih.gov/pubmed/33379282 http://dx.doi.org/10.3390/e23010031 |
_version_ | 1783640052437352448 |
---|---|
author | Markić, Ivan Štula, Maja Zorić, Marija Stipaničev, Darko |
author_facet | Markić, Ivan Štula, Maja Zorić, Marija Stipaničev, Darko |
author_sort | Markić, Ivan |
collection | PubMed |
description | The string-matching paradigm is applied in every computer science and science branch in general. The existence of a plethora of string-matching algorithms makes it hard to choose the best one for any particular case. Expressing, measuring, and testing algorithm efficiency is a challenging task with many potential pitfalls. Algorithm efficiency can be measured based on the usage of different resources. In software engineering, algorithmic productivity is a property of an algorithm execution identified with the computational resources the algorithm consumes. Resource usage in algorithm execution could be determined, and for maximum efficiency, the goal is to minimize resource usage. Guided by the fact that standard measures of algorithm efficiency, such as execution time, directly depend on the number of executed actions. Without touching the problematics of computer power consumption or memory, which also depends on the algorithm type and the techniques used in algorithm development, we have developed a methodology which enables the researchers to choose an efficient algorithm for a specific domain. String searching algorithms efficiency is usually observed independently from the domain texts being searched. This research paper aims to present the idea that algorithm efficiency depends on the properties of searched string and properties of the texts being searched, accompanied by the theoretical analysis of the proposed approach. In the proposed methodology, algorithm efficiency is expressed through character comparison count metrics. The character comparison count metrics is a formal quantitative measure independent of algorithm implementation subtleties and computer platform differences. The model is developed for a particular problem domain by using appropriate domain data (patterns and texts) and provides for a specific domain the ranking of algorithms according to the patterns’ entropy. The proposed approach is limited to on-line exact string-matching problems based on information entropy for a search pattern. Meticulous empirical testing depicts the methodology implementation and purports soundness of the methodology. |
format | Online Article Text |
id | pubmed-7824336 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-78243362021-02-24 Entropy-Based Approach in Selection Exact String-Matching Algorithms Markić, Ivan Štula, Maja Zorić, Marija Stipaničev, Darko Entropy (Basel) Article The string-matching paradigm is applied in every computer science and science branch in general. The existence of a plethora of string-matching algorithms makes it hard to choose the best one for any particular case. Expressing, measuring, and testing algorithm efficiency is a challenging task with many potential pitfalls. Algorithm efficiency can be measured based on the usage of different resources. In software engineering, algorithmic productivity is a property of an algorithm execution identified with the computational resources the algorithm consumes. Resource usage in algorithm execution could be determined, and for maximum efficiency, the goal is to minimize resource usage. Guided by the fact that standard measures of algorithm efficiency, such as execution time, directly depend on the number of executed actions. Without touching the problematics of computer power consumption or memory, which also depends on the algorithm type and the techniques used in algorithm development, we have developed a methodology which enables the researchers to choose an efficient algorithm for a specific domain. String searching algorithms efficiency is usually observed independently from the domain texts being searched. This research paper aims to present the idea that algorithm efficiency depends on the properties of searched string and properties of the texts being searched, accompanied by the theoretical analysis of the proposed approach. In the proposed methodology, algorithm efficiency is expressed through character comparison count metrics. The character comparison count metrics is a formal quantitative measure independent of algorithm implementation subtleties and computer platform differences. The model is developed for a particular problem domain by using appropriate domain data (patterns and texts) and provides for a specific domain the ranking of algorithms according to the patterns’ entropy. The proposed approach is limited to on-line exact string-matching problems based on information entropy for a search pattern. Meticulous empirical testing depicts the methodology implementation and purports soundness of the methodology. MDPI 2020-12-28 /pmc/articles/PMC7824336/ /pubmed/33379282 http://dx.doi.org/10.3390/e23010031 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Markić, Ivan Štula, Maja Zorić, Marija Stipaničev, Darko Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title | Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title_full | Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title_fullStr | Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title_full_unstemmed | Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title_short | Entropy-Based Approach in Selection Exact String-Matching Algorithms |
title_sort | entropy-based approach in selection exact string-matching algorithms |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7824336/ https://www.ncbi.nlm.nih.gov/pubmed/33379282 http://dx.doi.org/10.3390/e23010031 |
work_keys_str_mv | AT markicivan entropybasedapproachinselectionexactstringmatchingalgorithms AT stulamaja entropybasedapproachinselectionexactstringmatchingalgorithms AT zoricmarija entropybasedapproachinselectionexactstringmatchingalgorithms AT stipanicevdarko entropybasedapproachinselectionexactstringmatchingalgorithms |