Cargando…
Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies
BACKGROUND: Wearable smart watches provide large amount of real-time data on the environmental state of the users and are useful to determine risk factors for onset and progression of myopia. We aim to evaluate the efficacy of machine learning algorithm in differentiating indoor and outdoor location...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6751881/ https://www.ncbi.nlm.nih.gov/pubmed/31533735 http://dx.doi.org/10.1186/s12967-019-2057-2 |
_version_ | 1783452705119797248 |
---|---|
author | Ye, Bin Liu, Kangping Cao, Siting Sankaridurg, Padmaja Li, Wayne Luan, Mengli Zhang, Bo Zhu, Jianfeng Zou, Haidong Xu, Xun He, Xiangui |
author_facet | Ye, Bin Liu, Kangping Cao, Siting Sankaridurg, Padmaja Li, Wayne Luan, Mengli Zhang, Bo Zhu, Jianfeng Zou, Haidong Xu, Xun He, Xiangui |
author_sort | Ye, Bin |
collection | PubMed |
description | BACKGROUND: Wearable smart watches provide large amount of real-time data on the environmental state of the users and are useful to determine risk factors for onset and progression of myopia. We aim to evaluate the efficacy of machine learning algorithm in differentiating indoor and outdoor locations as collected by use of smart watches. METHODS: Real time data on luminance, ultraviolet light levels and number of steps obtained with smart watches from dataset A: 12 adults from 8 scenes and manually recorded true locations. 70% of data was considered training set and support vector machine (SVM) algorithm generated using the variables to create a classification system. Data collected manually by the adults was the reference. The algorithm was used for predicting the location of the remaining 30% of dataset A. Accuracy was defined as the number of correct predictions divided by all. Similarly, data was corrected from dataset B: 172 children from 3 schools and 12 supervisors recorded true locations. Data collected by the supervisors was the reference. SVM model trained from dataset A was used to predict the location of dataset B for validation. Finally, we predicted the location of dataset B using the SVM model self-trained from dataset B. We repeated these three predictions with traditional univariate threshold segmentation method. RESULTS: In both datasets, SVM outperformed the univariate threshold segmentation method. In dataset A, the accuracy and AUC of SVM were 99.55% and 0.99 as compared to 95.11% and 0.95 with the univariate threshold segmentation (p < 0.01). In validation, the accuracy and AUC of SVM were 82.67% and 0.90 compared to 80.88% and 0.85 with the univariate threshold segmentation method (p < 0.01). In dataset B, the accuracy and AUC of SVM and AUC were 92.43% and 0.96 compared to 80.88% and 0.85 with the univariate threshold segmentation (p < 0.01). CONCLUSIONS: Machine learning algorithm allows for discrimination of outdoor versus indoor environments with high accuracy and provides an opportunity to study and determine the role of environmental risk factors in onset and progression of myopia. The accuracy of machine learning algorithm could be improved if the model is trained with the dataset itself. |
format | Online Article Text |
id | pubmed-6751881 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-67518812019-09-23 Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies Ye, Bin Liu, Kangping Cao, Siting Sankaridurg, Padmaja Li, Wayne Luan, Mengli Zhang, Bo Zhu, Jianfeng Zou, Haidong Xu, Xun He, Xiangui J Transl Med Methodology BACKGROUND: Wearable smart watches provide large amount of real-time data on the environmental state of the users and are useful to determine risk factors for onset and progression of myopia. We aim to evaluate the efficacy of machine learning algorithm in differentiating indoor and outdoor locations as collected by use of smart watches. METHODS: Real time data on luminance, ultraviolet light levels and number of steps obtained with smart watches from dataset A: 12 adults from 8 scenes and manually recorded true locations. 70% of data was considered training set and support vector machine (SVM) algorithm generated using the variables to create a classification system. Data collected manually by the adults was the reference. The algorithm was used for predicting the location of the remaining 30% of dataset A. Accuracy was defined as the number of correct predictions divided by all. Similarly, data was corrected from dataset B: 172 children from 3 schools and 12 supervisors recorded true locations. Data collected by the supervisors was the reference. SVM model trained from dataset A was used to predict the location of dataset B for validation. Finally, we predicted the location of dataset B using the SVM model self-trained from dataset B. We repeated these three predictions with traditional univariate threshold segmentation method. RESULTS: In both datasets, SVM outperformed the univariate threshold segmentation method. In dataset A, the accuracy and AUC of SVM were 99.55% and 0.99 as compared to 95.11% and 0.95 with the univariate threshold segmentation (p < 0.01). In validation, the accuracy and AUC of SVM were 82.67% and 0.90 compared to 80.88% and 0.85 with the univariate threshold segmentation method (p < 0.01). In dataset B, the accuracy and AUC of SVM and AUC were 92.43% and 0.96 compared to 80.88% and 0.85 with the univariate threshold segmentation (p < 0.01). CONCLUSIONS: Machine learning algorithm allows for discrimination of outdoor versus indoor environments with high accuracy and provides an opportunity to study and determine the role of environmental risk factors in onset and progression of myopia. The accuracy of machine learning algorithm could be improved if the model is trained with the dataset itself. BioMed Central 2019-09-18 /pmc/articles/PMC6751881/ /pubmed/31533735 http://dx.doi.org/10.1186/s12967-019-2057-2 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Ye, Bin Liu, Kangping Cao, Siting Sankaridurg, Padmaja Li, Wayne Luan, Mengli Zhang, Bo Zhu, Jianfeng Zou, Haidong Xu, Xun He, Xiangui Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title | Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title_full | Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title_fullStr | Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title_full_unstemmed | Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title_short | Discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
title_sort | discrimination of indoor versus outdoor environmental state with machine learning algorithms in myopia observational studies |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6751881/ https://www.ncbi.nlm.nih.gov/pubmed/31533735 http://dx.doi.org/10.1186/s12967-019-2057-2 |
work_keys_str_mv | AT yebin discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT liukangping discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT caositing discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT sankaridurgpadmaja discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT liwayne discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT luanmengli discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT zhangbo discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT zhujianfeng discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT zouhaidong discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT xuxun discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies AT hexiangui discriminationofindoorversusoutdoorenvironmentalstatewithmachinelearningalgorithmsinmyopiaobservationalstudies |