Cargando…

Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling

Recent efforts have focused on identifying multidisciplinary teams and detecting co-Authorship Networks based on exploring topic modeling to identify researchers’ expertise. Though promising, none of these efforts perform a real-life evaluation of the quality of the built topics. This paper proposes...

Descripción completa

Detalles Bibliográficos
Autores principales: Viegas, Felipe, Pereira, Antônio, Cecílio, Pablo, Tuler, Elisa, Meira, Wagner, Gonçalves, Marcos, Rocha, Leonardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9273922/
https://www.ncbi.nlm.nih.gov/pubmed/35844248
http://dx.doi.org/10.1007/s11192-022-04449-9
_version_ 1784745199711289344
author Viegas, Felipe
Pereira, Antônio
Cecílio, Pablo
Tuler, Elisa
Meira, Wagner
Gonçalves, Marcos
Rocha, Leonardo
author_facet Viegas, Felipe
Pereira, Antônio
Cecílio, Pablo
Tuler, Elisa
Meira, Wagner
Gonçalves, Marcos
Rocha, Leonardo
author_sort Viegas, Felipe
collection PubMed
description Recent efforts have focused on identifying multidisciplinary teams and detecting co-Authorship Networks based on exploring topic modeling to identify researchers’ expertise. Though promising, none of these efforts perform a real-life evaluation of the quality of the built topics. This paper proposes a Semantic Academic Profiler (SAP) framework that allows summarizing articles written by researchers to automatically build research profiles and perform online evaluations regarding these built profiles. SAP exploits and extends state-of-the-art Topic Modeling strategies based on Cluwords considering n-grams and introduces a new visual interface able to highlight the main topics related to articles, researchers and institutions. To evaluate SAP’s capability of summarizing the profile of such entities as well as its usefulness for supporting online assessments of the topics’ quality, we perform and contrast two types of evaluation, considering an extensive repository of Brazilian curricula vitae: (1) an offline evaluation, in which we exploit a traditional metric (NPMI) to measure the quality of several data representations strategies including (i) TFIDF, (ii) TFIDF with Bi-grams, (iii) Cluwords, and (iv) CluWords with Bi-grams; and (2) an online evaluation through an A/B test where researchers evaluate their own built profiles. We also perform an online assessment of SAP user interface through a usability test following the SUS methodology. Our experiments indicate that the CluWords with Bi-grams is the best solution and the SAP interface is very useful. We also observed essential differences in the online and offline assessments, indicating that using both together is very important for a comprehensive quality evaluation. Such type of study is scarce in the literature and our findings open space for new lines of investigation in the Topic Modeling area.
format Online
Article
Text
id pubmed-9273922
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-92739222022-07-12 Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling Viegas, Felipe Pereira, Antônio Cecílio, Pablo Tuler, Elisa Meira, Wagner Gonçalves, Marcos Rocha, Leonardo Scientometrics Article Recent efforts have focused on identifying multidisciplinary teams and detecting co-Authorship Networks based on exploring topic modeling to identify researchers’ expertise. Though promising, none of these efforts perform a real-life evaluation of the quality of the built topics. This paper proposes a Semantic Academic Profiler (SAP) framework that allows summarizing articles written by researchers to automatically build research profiles and perform online evaluations regarding these built profiles. SAP exploits and extends state-of-the-art Topic Modeling strategies based on Cluwords considering n-grams and introduces a new visual interface able to highlight the main topics related to articles, researchers and institutions. To evaluate SAP’s capability of summarizing the profile of such entities as well as its usefulness for supporting online assessments of the topics’ quality, we perform and contrast two types of evaluation, considering an extensive repository of Brazilian curricula vitae: (1) an offline evaluation, in which we exploit a traditional metric (NPMI) to measure the quality of several data representations strategies including (i) TFIDF, (ii) TFIDF with Bi-grams, (iii) Cluwords, and (iv) CluWords with Bi-grams; and (2) an online evaluation through an A/B test where researchers evaluate their own built profiles. We also perform an online assessment of SAP user interface through a usability test following the SUS methodology. Our experiments indicate that the CluWords with Bi-grams is the best solution and the SAP interface is very useful. We also observed essential differences in the online and offline assessments, indicating that using both together is very important for a comprehensive quality evaluation. Such type of study is scarce in the literature and our findings open space for new lines of investigation in the Topic Modeling area. Springer International Publishing 2022-07-11 2022 /pmc/articles/PMC9273922/ /pubmed/35844248 http://dx.doi.org/10.1007/s11192-022-04449-9 Text en © Akadémiai Kiadó, Budapest, Hungary 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Viegas, Felipe
Pereira, Antônio
Cecílio, Pablo
Tuler, Elisa
Meira, Wagner
Gonçalves, Marcos
Rocha, Leonardo
Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title_full Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title_fullStr Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title_full_unstemmed Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title_short Semantic Academic Profiler (SAP): a framework for researcher assessment based on semantic topic modeling
title_sort semantic academic profiler (sap): a framework for researcher assessment based on semantic topic modeling
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9273922/
https://www.ncbi.nlm.nih.gov/pubmed/35844248
http://dx.doi.org/10.1007/s11192-022-04449-9
work_keys_str_mv AT viegasfelipe semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT pereiraantonio semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT ceciliopablo semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT tulerelisa semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT meirawagner semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT goncalvesmarcos semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling
AT rochaleonardo semanticacademicprofilersapaframeworkforresearcherassessmentbasedonsemantictopicmodeling