Cargando…
Machine learning on small size samples: A synthetic knowledge synthesis
Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with sma...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358596/ https://www.ncbi.nlm.nih.gov/pubmed/35220816 http://dx.doi.org/10.1177/00368504211029777 |
_version_ | 1785075698590810112 |
---|---|
author | Kokol, Peter Kokol, Marko Zagoranski, Sašo |
author_facet | Kokol, Peter Kokol, Marko Zagoranski, Sašo |
author_sort | Kokol, Peter |
collection | PubMed |
description | Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with small data samples. The present bibliometric knowledge synthesis study aims to answer the research question ‘What is the small data problem in machine learning and how it is solved?’ The analysis a positive trend in the number of research publications and substantial growth of the research community, indicating that the research field is reaching maturity. Most productive countries are China, United States and United Kingdom. Despite notable international cooperation, the regional concentration of research literature production in economically more developed countries was observed. Thematic analysis identified four research themes. The themes are concerned with to dimension reduction in complex big data analysis, data augmentation techniques in deep learning, data mining and statistical learning on small datasets. |
format | Online Article Text |
id | pubmed-10358596 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-103585962023-08-09 Machine learning on small size samples: A synthetic knowledge synthesis Kokol, Peter Kokol, Marko Zagoranski, Sašo Sci Prog Review Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a ‘Big data’ world where, almost ‘everything’ is digitally stored, there are many real-world situations, where researchers are still faced with small data samples. The present bibliometric knowledge synthesis study aims to answer the research question ‘What is the small data problem in machine learning and how it is solved?’ The analysis a positive trend in the number of research publications and substantial growth of the research community, indicating that the research field is reaching maturity. Most productive countries are China, United States and United Kingdom. Despite notable international cooperation, the regional concentration of research literature production in economically more developed countries was observed. Thematic analysis identified four research themes. The themes are concerned with to dimension reduction in complex big data analysis, data augmentation techniques in deep learning, data mining and statistical learning on small datasets. SAGE Publications 2022-02-27 /pmc/articles/PMC10358596/ /pubmed/35220816 http://dx.doi.org/10.1177/00368504211029777 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage) |
spellingShingle | Review Kokol, Peter Kokol, Marko Zagoranski, Sašo Machine learning on small size samples: A synthetic knowledge synthesis |
title | Machine learning on small size samples: A synthetic knowledge synthesis |
title_full | Machine learning on small size samples: A synthetic knowledge synthesis |
title_fullStr | Machine learning on small size samples: A synthetic knowledge synthesis |
title_full_unstemmed | Machine learning on small size samples: A synthetic knowledge synthesis |
title_short | Machine learning on small size samples: A synthetic knowledge synthesis |
title_sort | machine learning on small size samples: a synthetic knowledge synthesis |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358596/ https://www.ncbi.nlm.nih.gov/pubmed/35220816 http://dx.doi.org/10.1177/00368504211029777 |
work_keys_str_mv | AT kokolpeter machinelearningonsmallsizesamplesasyntheticknowledgesynthesis AT kokolmarko machinelearningonsmallsizesamplesasyntheticknowledgesynthesis AT zagoranskisaso machinelearningonsmallsizesamplesasyntheticknowledgesynthesis |