Cargando…

Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network

BACKGROUND: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). METHODS: We used the dataset of a study conducted to predict risk factors of...

Descripción completa

Detalles Bibliográficos
Autores principales: AMINI, Payam, AHMADINIA, Hasan, POOROLAJAL, Jalal, MOQADDASI AMIRI, Mohammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Tehran University of Medical Sciences 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5149472/
https://www.ncbi.nlm.nih.gov/pubmed/27957463
_version_ 1782474009779109888
author AMINI, Payam
AHMADINIA, Hasan
POOROLAJAL, Jalal
MOQADDASI AMIRI, Mohammad
author_facet AMINI, Payam
AHMADINIA, Hasan
POOROLAJAL, Jalal
MOQADDASI AMIRI, Mohammad
author_sort AMINI, Payam
collection PubMed
description BACKGROUND: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). METHODS: We used the dataset of a study conducted to predict risk factors of completed suicide in Hamadan Province, the west of Iran, in 2010. To evaluate the high-risk groups for suicide, LR, SVM, DT and ANN were performed. The applied methods were compared using sensitivity, specificity, positive predicted value, negative predicted value, accuracy and the area under curve. Cochran-Q test was implied to check differences in proportion among methods. To assess the association between the observed and predicted values, Ø coefficient, contingency coefficient, and Kendall tau-b were calculated. RESULTS: Gender, age, and job were the most important risk factors for fatal suicide attempts in common for four methods. SVM method showed the highest accuracy 0.68 and 0.67 for training and testing sample, respectively. However, this method resulted in the highest specificity (0.67 for training and 0.68 for testing sample) and the highest sensitivity for training sample (0.85), but the lowest sensitivity for the testing sample (0.53). Cochran-Q test resulted in differences between proportions in different methods (P<0.001). The association of SVM predictions and observed values, Ø coefficient, contingency coefficient, and Kendall tau-b were 0.239, 0.232 and 0.239, respectively. CONCLUSION: SVM had the best performance to classify fatal suicide attempts comparing to DT, LR and ANN.
format Online
Article
Text
id pubmed-5149472
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Tehran University of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-51494722016-12-12 Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network AMINI, Payam AHMADINIA, Hasan POOROLAJAL, Jalal MOQADDASI AMIRI, Mohammad Iran J Public Health Original Article BACKGROUND: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). METHODS: We used the dataset of a study conducted to predict risk factors of completed suicide in Hamadan Province, the west of Iran, in 2010. To evaluate the high-risk groups for suicide, LR, SVM, DT and ANN were performed. The applied methods were compared using sensitivity, specificity, positive predicted value, negative predicted value, accuracy and the area under curve. Cochran-Q test was implied to check differences in proportion among methods. To assess the association between the observed and predicted values, Ø coefficient, contingency coefficient, and Kendall tau-b were calculated. RESULTS: Gender, age, and job were the most important risk factors for fatal suicide attempts in common for four methods. SVM method showed the highest accuracy 0.68 and 0.67 for training and testing sample, respectively. However, this method resulted in the highest specificity (0.67 for training and 0.68 for testing sample) and the highest sensitivity for training sample (0.85), but the lowest sensitivity for the testing sample (0.53). Cochran-Q test resulted in differences between proportions in different methods (P<0.001). The association of SVM predictions and observed values, Ø coefficient, contingency coefficient, and Kendall tau-b were 0.239, 0.232 and 0.239, respectively. CONCLUSION: SVM had the best performance to classify fatal suicide attempts comparing to DT, LR and ANN. Tehran University of Medical Sciences 2016-09 /pmc/articles/PMC5149472/ /pubmed/27957463 Text en Copyright© Iranian Public Health Association & Tehran University of Medical Sciences This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License which allows users to read, copy, distribute and make derivative works for non-commercial purposes from the material, as long as the author of the original work is cited properly.
spellingShingle Original Article
AMINI, Payam
AHMADINIA, Hasan
POOROLAJAL, Jalal
MOQADDASI AMIRI, Mohammad
Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_full Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_fullStr Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_full_unstemmed Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_short Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
title_sort evaluating the high risk groups for suicide: a comparison of logistic regression, support vector machine, decision tree and artificial neural network
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5149472/
https://www.ncbi.nlm.nih.gov/pubmed/27957463
work_keys_str_mv AT aminipayam evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT ahmadiniahasan evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT poorolajaljalal evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork
AT moqaddasiamirimohammad evaluatingthehighriskgroupsforsuicideacomparisonoflogisticregressionsupportvectormachinedecisiontreeandartificialneuralnetwork