Cargando…

Performance evaluation of machine learning models on large dataset of android applications reviews

With an ever-increasing number of mobile users, the development of mobile applications (apps) has become a potential market during the past decade. Billions of users download mobile apps for divergent use from Google Play Store, fulfill tasks and leave comments about their experience. Such reviews a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Qureshi, Ali Adil, Ahmad, Maqsood, Ullah, Saleem, Yasir, Muhammad Naveed, Rustam, Furqan, Ashraf, Imran
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10024295/ https://www.ncbi.nlm.nih.gov/pubmed/37362743 http://dx.doi.org/10.1007/s11042-023-14713-6

_version_	1784909071007088640
author	Qureshi, Ali Adil Ahmad, Maqsood Ullah, Saleem Yasir, Muhammad Naveed Rustam, Furqan Ashraf, Imran
author_facet	Qureshi, Ali Adil Ahmad, Maqsood Ullah, Saleem Yasir, Muhammad Naveed Rustam, Furqan Ashraf, Imran
author_sort	Qureshi, Ali Adil
collection	PubMed
description	With an ever-increasing number of mobile users, the development of mobile applications (apps) has become a potential market during the past decade. Billions of users download mobile apps for divergent use from Google Play Store, fulfill tasks and leave comments about their experience. Such reviews are replete with a variety of feedback that serves as a guide for the improvement of existing apps and intuition for novel mobile apps. However, application reviews are challenging and very broad to approach. Such reviews, when segregated into different classes guide the user in the selection of suitable apps. This study proposes a framework for analyzing the sentiment of reviews for apps of eight different categories like shopping, sports, casual, etc. A large dataset is scrapped comprising 251661 user reviews with the help of ‘Regular Expression’ and ‘Beautiful Soup’. The framework follows the use of different machine learning models along with the term frequency-inverse document frequency (TF-IDF) for feature extraction. Extensive experiments are performed using preprocessing steps, as well as, the stats feature of app reviews to evaluate the performance of the models. Results indicate that combining the stats feature with TF-IDF shows better performance and the support vector machine obtains the highest accuracy. Experimental results can potentially be used by other researchers to select appropriate models for the analysis of app reviews. In addition, the provided dataset is large, diverse, and balanced with eight categories and 59 app reviews and provides the opportunity to analyze reviews using state-of-the-art approaches.
format	Online Article Text
id	pubmed-10024295
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-100242952023-03-21 Performance evaluation of machine learning models on large dataset of android applications reviews Qureshi, Ali Adil Ahmad, Maqsood Ullah, Saleem Yasir, Muhammad Naveed Rustam, Furqan Ashraf, Imran Multimed Tools Appl Article With an ever-increasing number of mobile users, the development of mobile applications (apps) has become a potential market during the past decade. Billions of users download mobile apps for divergent use from Google Play Store, fulfill tasks and leave comments about their experience. Such reviews are replete with a variety of feedback that serves as a guide for the improvement of existing apps and intuition for novel mobile apps. However, application reviews are challenging and very broad to approach. Such reviews, when segregated into different classes guide the user in the selection of suitable apps. This study proposes a framework for analyzing the sentiment of reviews for apps of eight different categories like shopping, sports, casual, etc. A large dataset is scrapped comprising 251661 user reviews with the help of ‘Regular Expression’ and ‘Beautiful Soup’. The framework follows the use of different machine learning models along with the term frequency-inverse document frequency (TF-IDF) for feature extraction. Extensive experiments are performed using preprocessing steps, as well as, the stats feature of app reviews to evaluate the performance of the models. Results indicate that combining the stats feature with TF-IDF shows better performance and the support vector machine obtains the highest accuracy. Experimental results can potentially be used by other researchers to select appropriate models for the analysis of app reviews. In addition, the provided dataset is large, diverse, and balanced with eight categories and 59 app reviews and provides the opportunity to analyze reviews using state-of-the-art approaches. Springer US 2023-03-18 /pmc/articles/PMC10024295/ /pubmed/37362743 http://dx.doi.org/10.1007/s11042-023-14713-6 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Qureshi, Ali Adil Ahmad, Maqsood Ullah, Saleem Yasir, Muhammad Naveed Rustam, Furqan Ashraf, Imran Performance evaluation of machine learning models on large dataset of android applications reviews
title	Performance evaluation of machine learning models on large dataset of android applications reviews
title_full	Performance evaluation of machine learning models on large dataset of android applications reviews
title_fullStr	Performance evaluation of machine learning models on large dataset of android applications reviews
title_full_unstemmed	Performance evaluation of machine learning models on large dataset of android applications reviews
title_short	Performance evaluation of machine learning models on large dataset of android applications reviews
title_sort	performance evaluation of machine learning models on large dataset of android applications reviews
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10024295/ https://www.ncbi.nlm.nih.gov/pubmed/37362743 http://dx.doi.org/10.1007/s11042-023-14713-6
work_keys_str_mv	AT qureshialiadil performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews AT ahmadmaqsood performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews AT ullahsaleem performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews AT yasirmuhammadnaveed performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews AT rustamfurqan performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews AT ashrafimran performanceevaluationofmachinelearningmodelsonlargedatasetofandroidapplicationsreviews

Performance evaluation of machine learning models on large dataset of android applications reviews

Ejemplares similares