Cargando…

Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies

BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Marateb, Hamid Reza, Mansourian, Marjan, Adibi, Peyman, Farina, Dario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963323/
https://www.ncbi.nlm.nih.gov/pubmed/24672565
_version_ 1782308494650638336
author Marateb, Hamid Reza
Mansourian, Marjan
Adibi, Peyman
Farina, Dario
author_facet Marateb, Hamid Reza
Mansourian, Marjan
Adibi, Peyman
Farina, Dario
author_sort Marateb, Hamid Reza
collection PubMed
description BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal–variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). ORDINAL-TO-INTERVAL SCALE CONVERSION EXAMPLE: a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. RESULTS: the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. CONCLUSION: by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables.
format Online
Article
Text
id pubmed-3963323
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-39633232014-03-26 Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies Marateb, Hamid Reza Mansourian, Marjan Adibi, Peyman Farina, Dario J Res Med Sci Review Article BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal–variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). ORDINAL-TO-INTERVAL SCALE CONVERSION EXAMPLE: a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. RESULTS: the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. CONCLUSION: by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables. Medknow Publications & Media Pvt Ltd 2014-01 /pmc/articles/PMC3963323/ /pubmed/24672565 Text en Copyright: © Journal of Research in Medical Sciences http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Marateb, Hamid Reza
Mansourian, Marjan
Adibi, Peyman
Farina, Dario
Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title_full Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title_fullStr Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title_full_unstemmed Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title_short Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
title_sort manipulating measurement scales in medical statistical analysis and data mining: a review of methodologies
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963323/
https://www.ncbi.nlm.nih.gov/pubmed/24672565
work_keys_str_mv AT maratebhamidreza manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies
AT mansourianmarjan manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies
AT adibipeyman manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies
AT farinadario manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies