Cargando…

The Importance of Nonlinear Transformations Use in Medical Data Analysis

BACKGROUND: The accumulation of data and its accessibility through easier-to-use platforms will allow data scientists and practitioners who are less sophisticated data analysts to get answers by using big data for many purposes in multiple ways. Data scientists working with medical data are aware of...

Descripción completa

Detalles Bibliográficos
Autores principales: Shachar, Netta, Mitelpunkt, Alexis, Kozlovski, Tal, Galili, Tal, Frostig, Tzviel, Brill, Barak, Marcus-Kalish, Mira, Benjamini, Yoav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5970282/
https://www.ncbi.nlm.nih.gov/pubmed/29752251
http://dx.doi.org/10.2196/medinform.7992
_version_ 1783326093399293952
author Shachar, Netta
Mitelpunkt, Alexis
Kozlovski, Tal
Galili, Tal
Frostig, Tzviel
Brill, Barak
Marcus-Kalish, Mira
Benjamini, Yoav
author_facet Shachar, Netta
Mitelpunkt, Alexis
Kozlovski, Tal
Galili, Tal
Frostig, Tzviel
Brill, Barak
Marcus-Kalish, Mira
Benjamini, Yoav
author_sort Shachar, Netta
collection PubMed
description BACKGROUND: The accumulation of data and its accessibility through easier-to-use platforms will allow data scientists and practitioners who are less sophisticated data analysts to get answers by using big data for many purposes in multiple ways. Data scientists working with medical data are aware of the importance of preprocessing, yet in many cases, the potential benefits of using nonlinear transformations is overlooked. OBJECTIVE: Our aim is to present a semi-automated approach of symmetry-aiming transformations tailored for medical data analysis and its advantages. METHODS: We describe 10 commonly encountered data types used in the medical field and the relevant transformations for each data type. Data from the Alzheimer’s Disease Neuroimaging Initiative study, Parkinson’s disease hospital cohort, and disease-simulating data were used to demonstrate the approach and its benefits. RESULTS: Symmetry-targeted monotone transformations were applied, and the advantages gained in variance, stability, linearity, and clustering are demonstrated. An open source application implementing the described methods was developed. Both linearity of relationships and increase of stability of variability improved after applying proper nonlinear transformation. Clustering simulated nonsymmetric data gave low agreement to the generating clusters (Rand value=0.681), while capturing the original structure after applying nonlinear transformation to symmetry (Rand value=0.986). CONCLUSIONS: This work presents the use of nonlinear transformations for medical data and the importance of their semi-automated choice. Using the described approach, the data analyst increases the ability to create simpler, more robust and translational models, thereby facilitating the interpretation and implementation of the analysis by medical practitioners. Applying nonlinear transformations as part of the preprocessing is essential to the quality and interpretability of results.
format Online
Article
Text
id pubmed-5970282
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-59702822018-06-01 The Importance of Nonlinear Transformations Use in Medical Data Analysis Shachar, Netta Mitelpunkt, Alexis Kozlovski, Tal Galili, Tal Frostig, Tzviel Brill, Barak Marcus-Kalish, Mira Benjamini, Yoav JMIR Med Inform Original Paper BACKGROUND: The accumulation of data and its accessibility through easier-to-use platforms will allow data scientists and practitioners who are less sophisticated data analysts to get answers by using big data for many purposes in multiple ways. Data scientists working with medical data are aware of the importance of preprocessing, yet in many cases, the potential benefits of using nonlinear transformations is overlooked. OBJECTIVE: Our aim is to present a semi-automated approach of symmetry-aiming transformations tailored for medical data analysis and its advantages. METHODS: We describe 10 commonly encountered data types used in the medical field and the relevant transformations for each data type. Data from the Alzheimer’s Disease Neuroimaging Initiative study, Parkinson’s disease hospital cohort, and disease-simulating data were used to demonstrate the approach and its benefits. RESULTS: Symmetry-targeted monotone transformations were applied, and the advantages gained in variance, stability, linearity, and clustering are demonstrated. An open source application implementing the described methods was developed. Both linearity of relationships and increase of stability of variability improved after applying proper nonlinear transformation. Clustering simulated nonsymmetric data gave low agreement to the generating clusters (Rand value=0.681), while capturing the original structure after applying nonlinear transformation to symmetry (Rand value=0.986). CONCLUSIONS: This work presents the use of nonlinear transformations for medical data and the importance of their semi-automated choice. Using the described approach, the data analyst increases the ability to create simpler, more robust and translational models, thereby facilitating the interpretation and implementation of the analysis by medical practitioners. Applying nonlinear transformations as part of the preprocessing is essential to the quality and interpretability of results. JMIR Publications 2018-05-11 /pmc/articles/PMC5970282/ /pubmed/29752251 http://dx.doi.org/10.2196/medinform.7992 Text en ©Netta Shachar, Alexis Mitelpunkt, Tal Kozlovski, Tal Galili, Tzviel Frostig, Barak Brill, Mira Marcus-Kalish, Yoav Benjamini. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 11.05.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Shachar, Netta
Mitelpunkt, Alexis
Kozlovski, Tal
Galili, Tal
Frostig, Tzviel
Brill, Barak
Marcus-Kalish, Mira
Benjamini, Yoav
The Importance of Nonlinear Transformations Use in Medical Data Analysis
title The Importance of Nonlinear Transformations Use in Medical Data Analysis
title_full The Importance of Nonlinear Transformations Use in Medical Data Analysis
title_fullStr The Importance of Nonlinear Transformations Use in Medical Data Analysis
title_full_unstemmed The Importance of Nonlinear Transformations Use in Medical Data Analysis
title_short The Importance of Nonlinear Transformations Use in Medical Data Analysis
title_sort importance of nonlinear transformations use in medical data analysis
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5970282/
https://www.ncbi.nlm.nih.gov/pubmed/29752251
http://dx.doi.org/10.2196/medinform.7992
work_keys_str_mv AT shacharnetta theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT mitelpunktalexis theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT kozlovskital theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT galilital theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT frostigtzviel theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT brillbarak theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT marcuskalishmira theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT benjaminiyoav theimportanceofnonlineartransformationsuseinmedicaldataanalysis
AT shacharnetta importanceofnonlineartransformationsuseinmedicaldataanalysis
AT mitelpunktalexis importanceofnonlineartransformationsuseinmedicaldataanalysis
AT kozlovskital importanceofnonlineartransformationsuseinmedicaldataanalysis
AT galilital importanceofnonlineartransformationsuseinmedicaldataanalysis
AT frostigtzviel importanceofnonlineartransformationsuseinmedicaldataanalysis
AT brillbarak importanceofnonlineartransformationsuseinmedicaldataanalysis
AT marcuskalishmira importanceofnonlineartransformationsuseinmedicaldataanalysis
AT benjaminiyoav importanceofnonlineartransformationsuseinmedicaldataanalysis