Cargando…

Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor

Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaikhan, Suluk, Phimoltares, Suphakant, Lursinsap, Chidchanok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959636/
https://www.ncbi.nlm.nih.gov/pubmed/33817005
http://dx.doi.org/10.7717/peerj-cs.355
_version_ 1783664992847921152
author Chaikhan, Suluk
Phimoltares, Suphakant
Lursinsap, Chidchanok
author_facet Chaikhan, Suluk
Phimoltares, Suphakant
Lursinsap, Chidchanok
author_sort Chaikhan, Suluk
collection PubMed
description Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive streaming data sources, data can obviously overflow the storage. Consequently, all classic sorting algorithms of the past are incapable of obtaining a correct sorted sequence because data to be sorted cannot be totally stored in the data storage. This paper proposes a new sorting algorithm called streaming data sort for streaming data on a uniprocessor constrained by a limited storage size and the correctness of the sorted order. Data continuously flow into the storage as consecutive chunks with chunk sizes less than the storage size. A theoretical analysis of the space bound and the time complexity is provided. The sorting time complexity is O (n), where n is the number of incoming data. The space complexity is O (M), where M is the storage size. The experimental results show that streaming data sort can handle a million permuted data by using a storage whose size is set as low as 35% of the data size. This proposed concept can be practically applied to various applications in different fields where the data always overflow the working storage and sorting process is needed.
format Online
Article
Text
id pubmed-7959636
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79596362021-04-02 Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor Chaikhan, Suluk Phimoltares, Suphakant Lursinsap, Chidchanok PeerJ Comput Sci Algorithms and Analysis of Algorithms Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive streaming data sources, data can obviously overflow the storage. Consequently, all classic sorting algorithms of the past are incapable of obtaining a correct sorted sequence because data to be sorted cannot be totally stored in the data storage. This paper proposes a new sorting algorithm called streaming data sort for streaming data on a uniprocessor constrained by a limited storage size and the correctness of the sorted order. Data continuously flow into the storage as consecutive chunks with chunk sizes less than the storage size. A theoretical analysis of the space bound and the time complexity is provided. The sorting time complexity is O (n), where n is the number of incoming data. The space complexity is O (M), where M is the storage size. The experimental results show that streaming data sort can handle a million permuted data by using a storage whose size is set as low as 35% of the data size. This proposed concept can be practically applied to various applications in different fields where the data always overflow the working storage and sorting process is needed. PeerJ Inc. 2021-02-12 /pmc/articles/PMC7959636/ /pubmed/33817005 http://dx.doi.org/10.7717/peerj-cs.355 Text en © 2021 Chaikhan et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Algorithms and Analysis of Algorithms
Chaikhan, Suluk
Phimoltares, Suphakant
Lursinsap, Chidchanok
Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title_full Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title_fullStr Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title_full_unstemmed Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title_short Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
title_sort correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
topic Algorithms and Analysis of Algorithms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959636/
https://www.ncbi.nlm.nih.gov/pubmed/33817005
http://dx.doi.org/10.7717/peerj-cs.355
work_keys_str_mv AT chaikhansuluk correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor
AT phimoltaressuphakant correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor
AT lursinsapchidchanok correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor