Cargando…

Machine learning on small size samples: A synthetic knowledge synthesis

Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with sma...

Descripción completa

Detalles Bibliográficos
Autores principales: Kokol, Peter, Kokol, Marko, Zagoranski, Sašo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358596/
https://www.ncbi.nlm.nih.gov/pubmed/35220816
http://dx.doi.org/10.1177/00368504211029777
_version_ 1785075698590810112
author Kokol, Peter
Kokol, Marko
Zagoranski, Sašo
author_facet Kokol, Peter
Kokol, Marko
Zagoranski, Sašo
author_sort Kokol, Peter
collection PubMed
description Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with small data samples. The present bibliometric knowledge synthesis study aims to answer the research question ‘What is the small data problem in machine learning and how it is solved?’ The analysis a positive trend in the number of research publications and substantial growth of the research community, indicating that the research field is reaching maturity. Most productive countries are China, United States and United Kingdom. Despite notable international cooperation, the regional concentration of research literature production in economically more developed countries was observed. Thematic analysis identified four research themes. The themes are concerned with to dimension reduction in complex big data analysis, data augmentation techniques in deep learning, data mining and statistical learning on small datasets.
format Online
Article
Text
id pubmed-10358596
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-103585962023-08-09 Machine learning on small size samples: A synthetic knowledge synthesis Kokol, Peter Kokol, Marko Zagoranski, Sašo Sci Prog Review Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with small data samples. The present bibliometric knowledge synthesis study aims to answer the research question ‘What is the small data problem in machine learning and how it is solved?’ The analysis a positive trend in the number of research publications and substantial growth of the research community, indicating that the research field is reaching maturity. Most productive countries are China, United States and United Kingdom. Despite notable international cooperation, the regional concentration of research literature production in economically more developed countries was observed. Thematic analysis identified four research themes. The themes are concerned with to dimension reduction in complex big data analysis, data augmentation techniques in deep learning, data mining and statistical learning on small datasets. SAGE Publications 2022-02-27 /pmc/articles/PMC10358596/ /pubmed/35220816 http://dx.doi.org/10.1177/00368504211029777 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage)
spellingShingle Review
Kokol, Peter
Kokol, Marko
Zagoranski, Sašo
Machine learning on small size samples: A synthetic knowledge synthesis
title Machine learning on small size samples: A synthetic knowledge synthesis
title_full Machine learning on small size samples: A synthetic knowledge synthesis
title_fullStr Machine learning on small size samples: A synthetic knowledge synthesis
title_full_unstemmed Machine learning on small size samples: A synthetic knowledge synthesis
title_short Machine learning on small size samples: A synthetic knowledge synthesis
title_sort machine learning on small size samples: a synthetic knowledge synthesis
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358596/
https://www.ncbi.nlm.nih.gov/pubmed/35220816
http://dx.doi.org/10.1177/00368504211029777
work_keys_str_mv AT kokolpeter machinelearningonsmallsizesamplesasyntheticknowledgesynthesis
AT kokolmarko machinelearningonsmallsizesamplesasyntheticknowledgesynthesis
AT zagoranskisaso machinelearningonsmallsizesamplesasyntheticknowledgesynthesis