Cargando…

Deception detection with machine learning: A systematic review and statistical analysis

Several studies applying Machine Learning to deception detection have been published in the last decade. A rich and complex set of settings, approaches, theories, and results is now available. Therefore, one may find it difficult to identify trends, successful paths, gaps, and opportunities for cont...

Descripción completa

Detalles Bibliográficos
Autores principales: Constâncio, Alex Sebastião, Tsunoda, Denise Fukumi, Silva, Helena de Fátima Nunes, da Silveira, Jocelaine Martins, Carvalho, Deborah Ribeiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9910662/
https://www.ncbi.nlm.nih.gov/pubmed/36757928
http://dx.doi.org/10.1371/journal.pone.0281323
_version_ 1784884830455988224
author Constâncio, Alex Sebastião
Tsunoda, Denise Fukumi
Silva, Helena de Fátima Nunes
da Silveira, Jocelaine Martins
Carvalho, Deborah Ribeiro
author_facet Constâncio, Alex Sebastião
Tsunoda, Denise Fukumi
Silva, Helena de Fátima Nunes
da Silveira, Jocelaine Martins
Carvalho, Deborah Ribeiro
author_sort Constâncio, Alex Sebastião
collection PubMed
description Several studies applying Machine Learning to deception detection have been published in the last decade. A rich and complex set of settings, approaches, theories, and results is now available. Therefore, one may find it difficult to identify trends, successful paths, gaps, and opportunities for contribution. The present literature review aims to provide the state of research regarding deception detection with Machine Learning. We followed the PRISMA protocol and retrieved 648 articles from ACM Digital Library, IEEE Xplore, Scopus, and Web of Science. 540 of them were screened (108 were duplicates). A final corpus of 81 documents has been summarized as mind maps. Metadata was extracted and has been encoded as Python dictionaries to support a statistical analysis scripted in Python programming language, and available as a collection of Jupyter Lab Notebooks in a GitHub repository. All are available as Jupyter Lab Notebooks. Neural Networks, Support Vector Machines, Random Forest, Decision Tree and K-nearest Neighbor are the five most explored techniques. The studies report a detection performance ranging from 51% to 100%, with 19 works reaching accuracy rate above 0.9. Monomodal, Bimodal, and Multimodal approaches were exploited and achieved various accuracy levels for detection. Bimodal and Multimodal approaches have become a trend over Monomodal ones, although there are high-performance examples of the latter. Studies that exploit language and linguistic features, 75% are dedicated to English. The findings include observations of the following: language and culture, emotional features, psychological traits, cognitive load, facial cues, complexity, performance, and Machine Learning topics. We also present a dataset benchmark. Main conclusions are that labeled datasets from real-life data are scarce. Also, there is still room for new approaches for deception detection with Machine Learning, especially if focused on languages and cultures other than English-based. Further research would greatly contribute by providing new labeled and multimodal datasets for deception detection, both for English and other languages.
format Online
Article
Text
id pubmed-9910662
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99106622023-02-10 Deception detection with machine learning: A systematic review and statistical analysis Constâncio, Alex Sebastião Tsunoda, Denise Fukumi Silva, Helena de Fátima Nunes da Silveira, Jocelaine Martins Carvalho, Deborah Ribeiro PLoS One Research Article Several studies applying Machine Learning to deception detection have been published in the last decade. A rich and complex set of settings, approaches, theories, and results is now available. Therefore, one may find it difficult to identify trends, successful paths, gaps, and opportunities for contribution. The present literature review aims to provide the state of research regarding deception detection with Machine Learning. We followed the PRISMA protocol and retrieved 648 articles from ACM Digital Library, IEEE Xplore, Scopus, and Web of Science. 540 of them were screened (108 were duplicates). A final corpus of 81 documents has been summarized as mind maps. Metadata was extracted and has been encoded as Python dictionaries to support a statistical analysis scripted in Python programming language, and available as a collection of Jupyter Lab Notebooks in a GitHub repository. All are available as Jupyter Lab Notebooks. Neural Networks, Support Vector Machines, Random Forest, Decision Tree and K-nearest Neighbor are the five most explored techniques. The studies report a detection performance ranging from 51% to 100%, with 19 works reaching accuracy rate above 0.9. Monomodal, Bimodal, and Multimodal approaches were exploited and achieved various accuracy levels for detection. Bimodal and Multimodal approaches have become a trend over Monomodal ones, although there are high-performance examples of the latter. Studies that exploit language and linguistic features, 75% are dedicated to English. The findings include observations of the following: language and culture, emotional features, psychological traits, cognitive load, facial cues, complexity, performance, and Machine Learning topics. We also present a dataset benchmark. Main conclusions are that labeled datasets from real-life data are scarce. Also, there is still room for new approaches for deception detection with Machine Learning, especially if focused on languages and cultures other than English-based. Further research would greatly contribute by providing new labeled and multimodal datasets for deception detection, both for English and other languages. Public Library of Science 2023-02-09 /pmc/articles/PMC9910662/ /pubmed/36757928 http://dx.doi.org/10.1371/journal.pone.0281323 Text en © 2023 Constâncio et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Constâncio, Alex Sebastião
Tsunoda, Denise Fukumi
Silva, Helena de Fátima Nunes
da Silveira, Jocelaine Martins
Carvalho, Deborah Ribeiro
Deception detection with machine learning: A systematic review and statistical analysis
title Deception detection with machine learning: A systematic review and statistical analysis
title_full Deception detection with machine learning: A systematic review and statistical analysis
title_fullStr Deception detection with machine learning: A systematic review and statistical analysis
title_full_unstemmed Deception detection with machine learning: A systematic review and statistical analysis
title_short Deception detection with machine learning: A systematic review and statistical analysis
title_sort deception detection with machine learning: a systematic review and statistical analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9910662/
https://www.ncbi.nlm.nih.gov/pubmed/36757928
http://dx.doi.org/10.1371/journal.pone.0281323
work_keys_str_mv AT constancioalexsebastiao deceptiondetectionwithmachinelearningasystematicreviewandstatisticalanalysis
AT tsunodadenisefukumi deceptiondetectionwithmachinelearningasystematicreviewandstatisticalanalysis
AT silvahelenadefatimanunes deceptiondetectionwithmachinelearningasystematicreviewandstatisticalanalysis
AT dasilveirajocelainemartins deceptiondetectionwithmachinelearningasystematicreviewandstatisticalanalysis
AT carvalhodeborahribeiro deceptiondetectionwithmachinelearningasystematicreviewandstatisticalanalysis