Cargando…
Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor
Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959636/ https://www.ncbi.nlm.nih.gov/pubmed/33817005 http://dx.doi.org/10.7717/peerj-cs.355 |
_version_ | 1783664992847921152 |
---|---|
author | Chaikhan, Suluk Phimoltares, Suphakant Lursinsap, Chidchanok |
author_facet | Chaikhan, Suluk Phimoltares, Suphakant Lursinsap, Chidchanok |
author_sort | Chaikhan, Suluk |
collection | PubMed |
description | Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive streaming data sources, data can obviously overflow the storage. Consequently, all classic sorting algorithms of the past are incapable of obtaining a correct sorted sequence because data to be sorted cannot be totally stored in the data storage. This paper proposes a new sorting algorithm called streaming data sort for streaming data on a uniprocessor constrained by a limited storage size and the correctness of the sorted order. Data continuously flow into the storage as consecutive chunks with chunk sizes less than the storage size. A theoretical analysis of the space bound and the time complexity is provided. The sorting time complexity is O (n), where n is the number of incoming data. The space complexity is O (M), where M is the storage size. The experimental results show that streaming data sort can handle a million permuted data by using a storage whose size is set as low as 35% of the data size. This proposed concept can be practically applied to various applications in different fields where the data always overflow the working storage and sorting process is needed. |
format | Online Article Text |
id | pubmed-7959636 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79596362021-04-02 Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor Chaikhan, Suluk Phimoltares, Suphakant Lursinsap, Chidchanok PeerJ Comput Sci Algorithms and Analysis of Algorithms Tremendous quantities of numeric data have been generated as streams in various cyber ecosystems. Sorting is one of the most fundamental operations to gain knowledge from data. However, due to size restrictions of data storage which includes storage inside and outside CPU with respect to the massive streaming data sources, data can obviously overflow the storage. Consequently, all classic sorting algorithms of the past are incapable of obtaining a correct sorted sequence because data to be sorted cannot be totally stored in the data storage. This paper proposes a new sorting algorithm called streaming data sort for streaming data on a uniprocessor constrained by a limited storage size and the correctness of the sorted order. Data continuously flow into the storage as consecutive chunks with chunk sizes less than the storage size. A theoretical analysis of the space bound and the time complexity is provided. The sorting time complexity is O (n), where n is the number of incoming data. The space complexity is O (M), where M is the storage size. The experimental results show that streaming data sort can handle a million permuted data by using a storage whose size is set as low as 35% of the data size. This proposed concept can be practically applied to various applications in different fields where the data always overflow the working storage and sorting process is needed. PeerJ Inc. 2021-02-12 /pmc/articles/PMC7959636/ /pubmed/33817005 http://dx.doi.org/10.7717/peerj-cs.355 Text en © 2021 Chaikhan et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Algorithms and Analysis of Algorithms Chaikhan, Suluk Phimoltares, Suphakant Lursinsap, Chidchanok Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title | Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title_full | Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title_fullStr | Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title_full_unstemmed | Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title_short | Correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
title_sort | correct and stable sorting for overflow streaming data with a limited storage size and a uniprocessor |
topic | Algorithms and Analysis of Algorithms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959636/ https://www.ncbi.nlm.nih.gov/pubmed/33817005 http://dx.doi.org/10.7717/peerj-cs.355 |
work_keys_str_mv | AT chaikhansuluk correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor AT phimoltaressuphakant correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor AT lursinsapchidchanok correctandstablesortingforoverflowstreamingdatawithalimitedstoragesizeandauniprocessor |