Cargando…

Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter

Social media is an important channel for communication, information dissemination, and social interaction, but also provides opportunities to illicitly sell goods online, including the trade of wildlife products. In this study, we use the Twitter public application programming interface (API) to acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Qing, Li, Jiawei, Cai, Mingxiang, Mackey, Tim K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931875/
https://www.ncbi.nlm.nih.gov/pubmed/33693351
http://dx.doi.org/10.3389/fdata.2019.00028
_version_ 1783660372634370048
author Xu, Qing
Li, Jiawei
Cai, Mingxiang
Mackey, Tim K.
author_facet Xu, Qing
Li, Jiawei
Cai, Mingxiang
Mackey, Tim K.
author_sort Xu, Qing
collection PubMed
description Social media is an important channel for communication, information dissemination, and social interaction, but also provides opportunities to illicitly sell goods online, including the trade of wildlife products. In this study, we use the Twitter public application programming interface (API) to access Twitter messages in order to detect and classify suspicious wildlife trafficking and sale using an unsupervised machine learning topic model combined with keyword filtering and manual annotation. We choose two prohibited wildlife animals and related products: elephant ivory and pangolin, and collected tweets containing keywords and known code words related to these species. In total, we collected 138,357 tweets filtered for these keywords over a 14-day period and were able to identify 53 tweets from 38 unique users that we suspect promoted the sale of Ivory products, though no pangolin related promoted post were detected in this study. Study results show that machine learning combined with supplement analysis approaches such as those utilized in this study have the potential to detect illegal content without the use of an existing training data set. If developed further, these approaches can help technology companies, conservation groups, and law enforcement officials to expedite the process of identifying illegal online sales and stem supply for the billion-dollar criminal industry of online wildlife trafficking.
format Online
Article
Text
id pubmed-7931875
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79318752021-03-09 Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter Xu, Qing Li, Jiawei Cai, Mingxiang Mackey, Tim K. Front Big Data Big Data Social media is an important channel for communication, information dissemination, and social interaction, but also provides opportunities to illicitly sell goods online, including the trade of wildlife products. In this study, we use the Twitter public application programming interface (API) to access Twitter messages in order to detect and classify suspicious wildlife trafficking and sale using an unsupervised machine learning topic model combined with keyword filtering and manual annotation. We choose two prohibited wildlife animals and related products: elephant ivory and pangolin, and collected tweets containing keywords and known code words related to these species. In total, we collected 138,357 tweets filtered for these keywords over a 14-day period and were able to identify 53 tweets from 38 unique users that we suspect promoted the sale of Ivory products, though no pangolin related promoted post were detected in this study. Study results show that machine learning combined with supplement analysis approaches such as those utilized in this study have the potential to detect illegal content without the use of an existing training data set. If developed further, these approaches can help technology companies, conservation groups, and law enforcement officials to expedite the process of identifying illegal online sales and stem supply for the billion-dollar criminal industry of online wildlife trafficking. Frontiers Media S.A. 2019-08-27 /pmc/articles/PMC7931875/ /pubmed/33693351 http://dx.doi.org/10.3389/fdata.2019.00028 Text en Copyright © 2019 Xu, Li, Cai and Mackey. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Xu, Qing
Li, Jiawei
Cai, Mingxiang
Mackey, Tim K.
Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title_full Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title_fullStr Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title_full_unstemmed Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title_short Use of Machine Learning to Detect Wildlife Product Promotion and Sales on Twitter
title_sort use of machine learning to detect wildlife product promotion and sales on twitter
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931875/
https://www.ncbi.nlm.nih.gov/pubmed/33693351
http://dx.doi.org/10.3389/fdata.2019.00028
work_keys_str_mv AT xuqing useofmachinelearningtodetectwildlifeproductpromotionandsalesontwitter
AT lijiawei useofmachinelearningtodetectwildlifeproductpromotionandsalesontwitter
AT caimingxiang useofmachinelearningtodetectwildlifeproductpromotionandsalesontwitter
AT mackeytimk useofmachinelearningtodetectwildlifeproductpromotionandsalesontwitter