Cargando…
Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation
The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques o...
Autores principales: | , |
---|---|
Formato: | Online Artículo |
Lenguaje: | por |
Publicado: |
Instituto de Investigaciones Bibliotecológicas y de la Información
2021
|
Materias: | |
Acceso en línea: | http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389 https://dx.doi.org/10.22201/iibi.24488321xe.2021.88.58389 |
_version_ | 1780761264348200960 |
---|---|
author | Silva, Marcio Ponciano da Viera, Angel Freddy Godoy |
author_facet | Silva, Marcio Ponciano da Viera, Angel Freddy Godoy |
author_sort | Silva, Marcio Ponciano da |
collection | Investigación Bibliotecológica: archivonomía, bibliotecología e información |
description | The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques of case folding, tokenization, custom stopwords, bag of words and TF-IDF in order to extract results in ngrams. The results were presented with word clouds. In the research, k-means were used to cluster the sets of trigrams, identifying in each clusters the most representative terms of the clusters. The use of text mining techniques on these documents was intended to extract non-trivial knowledge. The techniques of text mining, or discovery of knowledge in a textual database, have the purpose of discovering unobservable patterns when analyzed by human manipulation of large volumes of documents. The results found favored the discovery of knowledge in the identification of entities and connections, as well as thematic categories of the investigation. |
format | Online Article |
id | oai_unam-bibliotecologica-article-58389 |
institution | Universidad Nacional Autónoma de México |
language | por |
publishDate | 2021 |
publisher | Instituto de Investigaciones Bibliotecológicas y de la Información |
record_format | ojs |
spelling | oai_unam-bibliotecologica-article-583892021-10-21T17:04:16Z Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation Descubrimiento de conocimientos mediante técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña Descoberta de conhecimento com uso de técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira Silva, Marcio Ponciano da Viera, Angel Freddy Godoy Investigación Policial Descubrimiento del Conocimiento Extracción de Textos Investigação Policial Descoberta de Conhecimento Mineração de Textos The aim of this study is to analyze how text mining techniques applied to textual documents of Brazilian police investigation can promote knowledge discovery. The research collected documents from the police investigation and submitted them to the text mining process. The study used the techniques of case folding, tokenization, custom stopwords, bag of words and TF-IDF in order to extract results in ngrams. The results were presented with word clouds. In the research, k-means were used to cluster the sets of trigrams, identifying in each clusters the most representative terms of the clusters. The use of text mining techniques on these documents was intended to extract non-trivial knowledge. The techniques of text mining, or discovery of knowledge in a textual database, have the purpose of discovering unobservable patterns when analyzed by human manipulation of large volumes of documents. The results found favored the discovery of knowledge in the identification of entities and connections, as well as thematic categories of the investigation. El objetivo de este estudio es analizar cómo las técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña pueden promover el descubrimiento de conocimiento. La investigación recopiló documentos de la investigación policial y los sometió al proceso de minería de textos. El estudio utilizó las técnicas de plegado de casos, tokenización, palabras vacías personalizadas, bolsa de palabras y TF-IDF para extraer los resultados en ngramas. Los resultados se presentaron con nubes de palabras. En la investigación, se utilizaron k-medias para agrupar los conjuntos de trigramas, identificando en cada grupo los términos más representativos de los grupos. El uso de técnicas de minería de textos en estos documentos tenía como objetivo extraer conocimientos no triviales. Las técnicas de minería de texto, o descubrimiento de conocimiento en una base de datos textual, tienen el propósito de descubrir patrones inobservables cuando se analizan mediante manipulación humana de grandes volúmenes de documentos. Los resultados encontrados favorecieron el descubrimiento de conocimientos en la identificación de entidades y conexiones, así como categorías temáticas de la investigación. O objetivo deste estudo é analisar como técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira pode promover descoberta de conhecimento. A pesquisa coletou documentos da investigação policial e submeteu ao processo de mineração de textos. O estudo utilizou as técnicas de case folding, tokenização, stopwords personalizada, bag of words e TF-IDF para extrair resultados em n-grams. Os resultados foram apresentados com word clouds. Na pesquisa foi usado o k-means para clusterizar os conjuntos de trigramas, identificando em cada clusters os termos mais representativos dos clusters. O uso de técnicas de mineração de texto sobre esses documentos teve como propósito a extração de conhecimento não trivial. As técnicas de mineração de texto, ou descoberta de conhecimento em base de dados textual, tem a finalidade de descobrir padrões não observáveis quando analisados por manipulação humana de grande volume de documentos. Os resultados encontrados favoreceram a descoberta de conhecimentos na identificação de entidades e conexões, como também categorias temáticas da investigação. Instituto de Investigaciones Bibliotecológicas y de la Información 2021-08-29 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion application/pdf text/html http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389 10.22201/iibi.24488321xe.2021.88.58389 Investigación Bibliotecológica. Archivonomía, bibliotecología e información; Vol. 35 No. 88 (2021); 161-183- Investigación Bibliotecológica: archivonomía, bibliotecología e información; Vol. 35 Núm. 88 (2021); 161-183- Investigación Bibliotecológica: archivonomía, bibliotecología e información; v. 35 n. 88 (2021); 161-183- 2448-8321 0187-358X 10.22201/iibi.24488321xe.2021.88 por http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389/52183 http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389/52225 Derechos de autor 2021 Investigación Bibliotecológica: archivonomía, bibliotecología e información |
spellingShingle | Investigación Policial Descubrimiento del Conocimiento Extracción de Textos Investigação Policial Descoberta de Conhecimento Mineração de Textos Silva, Marcio Ponciano da Viera, Angel Freddy Godoy Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title | Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title_alt | Descubrimiento de conocimientos mediante técnicas de minería de textos aplicadas a documentos textuales de la investigación policial brasileña Descoberta de conhecimento com uso de técnicas de mineração de textos aplicadas em documentos textuais da investigação policial brasileira |
title_full | Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title_fullStr | Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title_full_unstemmed | Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title_short | Discovery of knowledge using text mining techniques applied to textual documents of Brazilian police investigation |
title_sort | discovery of knowledge using text mining techniques applied to textual documents of brazilian police investigation |
topic | Investigación Policial Descubrimiento del Conocimiento Extracción de Textos Investigação Policial Descoberta de Conhecimento Mineração de Textos |
topic_facet | Investigación Policial Descubrimiento del Conocimiento Extracción de Textos Investigação Policial Descoberta de Conhecimento Mineração de Textos |
url | http://rev-ib.unam.mx/ib/index.php/ib/article/view/58389 https://dx.doi.org/10.22201/iibi.24488321xe.2021.88.58389 |
work_keys_str_mv | AT silvamarcioponcianoda discoveryofknowledgeusingtextminingtechniquesappliedtotextualdocumentsofbrazilianpoliceinvestigation AT vieraangelfreddygodoy discoveryofknowledgeusingtextminingtechniquesappliedtotextualdocumentsofbrazilianpoliceinvestigation AT silvamarcioponcianoda descubrimientodeconocimientosmediantetecnicasdemineriadetextosaplicadasadocumentostextualesdelainvestigacionpolicialbrasilena AT vieraangelfreddygodoy descubrimientodeconocimientosmediantetecnicasdemineriadetextosaplicadasadocumentostextualesdelainvestigacionpolicialbrasilena AT silvamarcioponcianoda descobertadeconhecimentocomusodetecnicasdemineracaodetextosaplicadasemdocumentostextuaisdainvestigacaopolicialbrasileira AT vieraangelfreddygodoy descobertadeconhecimentocomusodetecnicasdemineracaodetextosaplicadasemdocumentostextuaisdainvestigacaopolicialbrasileira |