Cargando…

Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation

The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques o...

Descripción completa

Detalles Bibliográficos
Autores principales: Silva, Marcio Ponciano da, Viera, Angel Freddy Godoy
Formato: Online Artículo
Lenguaje:por
Publicado: Instituto de Investigaciones Bibliotecológicas y de la Información 2021
Materias:
Acceso en línea:http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389
https://dx.doi.org/10.22201/iibi.24488321xe.2021.88.58389
_version_ 1780761264348200960
author Silva, Marcio Ponciano da
Viera, Angel Freddy Godoy
author_facet Silva, Marcio Ponciano da
Viera, Angel Freddy Godoy
author_sort Silva, Marcio Ponciano da
collection Investigación Bibliotecológica: archivonomía, bibliotecología e información
description The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques of case folding, tokenization, custom stopwords, bag of words and TF-IDF in order to extract results in ngrams. The results were presented with word clouds. In the research, k-means were used to cluster the sets of trigrams, identifying in each clusters the most representative terms of the clusters. The use of text mining techniques on these documents was intended to extract non-trivial knowledge. The techniques of text mining, or discovery of knowledge in a textual database, have the purpose of discovering unobservable patterns when analyzed by human manipulation of large volumes of documents. The results found favored the discovery of knowledge in the identification of entities and connections, as well as thematic categories of the investigation.
format Online
Article
id oai_unam-bibliotecologica-article-58389
institution Universidad Nacional Autónoma de México
language por
publishDate 2021
publisher Instituto de Investigaciones Bibliotecológicas y de la Información
record_format ojs
spelling oai_unam-bibliotecologica-article-583892021-10-21T17:04:16Z Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation Descubrimiento de conocimientos mediante técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña Descoberta de conhecimento com uso de técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira Silva, Marcio Ponciano da Viera, Angel Freddy Godoy Investigación Policial Descubrimiento del Conocimiento Extracción de Textos Investigação Policial Descoberta de Conhecimento Mineração de Textos The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques of case folding, tokenization, custom stopwords, bag of words and TF-IDF in order to extract results in ngrams. The results were presented with word clouds. In the research, k-means were used to cluster the sets of trigrams, identifying in each clusters the most representative terms of the clusters. The use of text mining techniques on these documents was intended to extract non-trivial knowledge. The techniques of text mining, or discovery of knowledge in a textual database, have the purpose of discovering unobservable patterns when analyzed by human manipulation of large volumes of documents. The results found favored the discovery of knowledge in the identification of entities and connections, as well as thematic categories of the investigation. El objetivo de este estudio es analizar cómo las técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña pueden promover el descubrimiento de conocimiento. La investigación recopiló documentos de la investigación policial y los sometió al proceso de minería de textos. El estudio utilizó las técnicas de plegado de casos, tokenización, palabras vacías personalizadas, bolsa de palabras y TF-IDF para extraer los resultados en ngramas. Los resultados se presentaron con nubes de palabras. En la investigación, se utilizaron k-medias para agrupar los conjuntos de trigramas, identificando en cada grupo los términos más representativos de los grupos. El uso de técnicas de minería de textos en estos documentos tenía como objetivo extraer conocimientos no triviales. Las técnicas de minería de texto, o descubrimiento de conocimiento en una base de datos textual, tienen el propósito de descubrir patrones inobservables cuando se analizan mediante manipulación humana de grandes volúmenes de documentos. Los resultados encontrados favorecieron el descubrimiento de conocimientos en la identificación de entidades y conexiones, así como categorías temáticas de la investigación. O objetivo deste estudo é analisar como técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira pode promover descoberta de conhecimento. A pesquisa coletou documentos da investigação policial e submeteu ao processo de mineração de textos. O estudo utilizou as técnicas de case folding, tokenização, stopwords personalizada, bag of words e TF-IDF para extrair resultados em n-grams. Os resultados foram apresentados com word clouds. Na pesquisa foi usado o k-means para clusterizar os conjuntos de trigramas, identificando em cada clusters os termos mais representativos dos clusters. O uso de técnicas de mineração de texto sobre esses documentos teve como propósito a extração de conhecimento não trivial. As técnicas de mineração de texto, ou descoberta de conhecimento em base de dados textual, tem a finalidade de descobrir padrões não observáveis quando analisados por manipulação humana de grande volume de documentos. Os resultados encontrados favoreceram a descoberta de conhecimentos na identificação de entidades e conexões, como também categorias temáticas da investigação. Instituto de Investigaciones Bibliotecológicas y de la Información 2021-08-29 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion application/pdf text/html http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389 10.22201/iibi.24488321xe.2021.88.58389 Investigación Bibliotecológica. Archivonomía, bibliotecología e información; Vol. 35 No. 88 (2021); 161-183- Investigación Bibliotecológica: archivonomía, bibliotecología e información; Vol. 35 Núm. 88 (2021); 161-183- Investigación Bibliotecológica: archivonomía, bibliotecología e información; v. 35 n. 88 (2021); 161-183- 2448-8321 0187-358X 10.22201/iibi.24488321xe.2021.88 por http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389/52183 http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389/52225 Derechos de autor 2021 Investigación Bibliotecológica: archivonomía, bibliotecología e información
spellingShingle Investigación Policial
Descubrimiento del Conocimiento
Extracción de Textos
Investigação Policial
Descoberta de Conhecimento
Mineração de Textos
Silva, Marcio Ponciano da
Viera, Angel Freddy Godoy
Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title_alt Descubrimiento de conocimientos mediante técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña
Descoberta de conhecimento com uso de técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira
title_full Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title_fullStr Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title_full_unstemmed Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title_short Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
title_sort discovery of knowledge using text mining techniques applied to textual documents of brazilian police investigation
topic Investigación Policial
Descubrimiento del Conocimiento
Extracción de Textos
Investigação Policial
Descoberta de Conhecimento
Mineração de Textos
topic_facet Investigación Policial
Descubrimiento del Conocimiento
Extracción de Textos
Investigação Policial
Descoberta de Conhecimento
Mineração de Textos
url http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389
https://dx.doi.org/10.22201/iibi.24488321xe.2021.88.58389
work_keys_str_mv AT silvamarcioponcianoda discoveryofknowledgeusingtextminingtechniquesappliedtotextualdocumentsofbrazilianpoliceinvestigation
AT vieraangelfreddygodoy discoveryofknowledgeusingtextminingtechniquesappliedtotextualdocumentsofbrazilianpoliceinvestigation
AT silvamarcioponcianoda descubrimientodeconocimientosmediantetecnicasdemineriadetextosaplicadasadocumentostextualesdelainvestigacionpolicialbrasilena
AT vieraangelfreddygodoy descubrimientodeconocimientosmediantetecnicasdemineriadetextosaplicadasadocumentostextualesdelainvestigacionpolicialbrasilena
AT silvamarcioponcianoda descobertadeconhecimentocomusodetecnicasdemineracaodetextosaplicadasemdocumentostextuaisdainvestigacaopolicialbrasileira
AT vieraangelfreddygodoy descobertadeconhecimentocomusodetecnicasdemineracaodetextosaplicadasemdocumentostextuaisdainvestigacaopolicialbrasileira