Cargando…
Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied bas...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Medknow Publications & Media Pvt Ltd
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963323/ https://www.ncbi.nlm.nih.gov/pubmed/24672565 |
_version_ | 1782308494650638336 |
---|---|
author | Marateb, Hamid Reza Mansourian, Marjan Adibi, Peyman Farina, Dario |
author_facet | Marateb, Hamid Reza Mansourian, Marjan Adibi, Peyman Farina, Dario |
author_sort | Marateb, Hamid Reza |
collection | PubMed |
description | BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal–variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). ORDINAL-TO-INTERVAL SCALE CONVERSION EXAMPLE: a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. RESULTS: the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. CONCLUSION: by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables. |
format | Online Article Text |
id | pubmed-3963323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Medknow Publications & Media Pvt Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-39633232014-03-26 Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies Marateb, Hamid Reza Mansourian, Marjan Adibi, Peyman Farina, Dario J Res Med Sci Review Article BACKGROUND: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal–variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). ORDINAL-TO-INTERVAL SCALE CONVERSION EXAMPLE: a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. RESULTS: the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. CONCLUSION: by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables. Medknow Publications & Media Pvt Ltd 2014-01 /pmc/articles/PMC3963323/ /pubmed/24672565 Text en Copyright: © Journal of Research in Medical Sciences http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article Marateb, Hamid Reza Mansourian, Marjan Adibi, Peyman Farina, Dario Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title | Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title_full | Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title_fullStr | Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title_full_unstemmed | Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title_short | Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies |
title_sort | manipulating measurement scales in medical statistical analysis and data mining: a review of methodologies |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963323/ https://www.ncbi.nlm.nih.gov/pubmed/24672565 |
work_keys_str_mv | AT maratebhamidreza manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies AT mansourianmarjan manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies AT adibipeyman manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies AT farinadario manipulatingmeasurementscalesinmedicalstatisticalanalysisanddataminingareviewofmethodologies |