Cargando…

Automatic classification experience of documents about Life Sciences and Biomedicine obtained in the Web of Science

This article provides an experience in the development and proof of a classification algorithm that automatically assigns a theme to a document. The Naïve Bayes Multinomial classification was used to automatically analyze the correlation between the themes of research in Life Sciences and Biomedicin...

Descripción completa

Detalles Bibliográficos
Autores principales: Polo Bautista, Luis Roberto, Polo Bautista, Israel
Formato: Online Artículo
Lenguaje:spa
Publicado: Instituto de Investigaciones Bibliotecológicas y de la Información 2022
Materias:
Acceso en línea:http://rev-ib.unam.mx/ib/index.php/ib/article/view/58607
https://dx.doi.org/10.22201/iibi.24488321xe.2022.93.58607
Descripción
Sumario:This article provides an experience in the development and proof of a classification algorithm that automatically assigns a theme to a document. The Naïve Bayes Multinomial classification was used to automatically analyze the correlation between the themes of research in Life Sciences and Biomedicine, and the result of a corpus of 10 167 articles recuperated from the Web of Science (WoS). A proof of the performance of the algorithm was applied to 5 581 reviews for measuring its precision, exhaustivity and accuracy. The results show its usefulness in the organization of information respect to the automatic assignation of themes to the documents obtained in a digital repository or a bibliographic data base. The algorithm proposed can be utilized as an alternative to the traditional methods of classification of documents in a specific area of knowledge; this will allow the creation of specialized services oriented to the development of computational services that support the digital and electronic information management.