Cargando…

Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data

Large-scale data sources, remote sensing technologies, and superior computing power have tremendously benefitted to environmental health study. Recently, various machine-learning algorithms were introduced to provide mechanistic insights about the heterogeneity of clustered data pertaining to the sy...

Descripción completa

Detalles Bibliográficos
Autores principales: Bae, Wan D., Kim, Sungroul, Park, Choon-Sik, Alkobaisi, Shayma, Lee, Jongwon, Seo, Wonseok, Park, Jong Sook, Park, Sujung, Lee, Sangwoon, Lee, Jong Wook
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7790419/
https://www.ncbi.nlm.nih.gov/pubmed/33411771
http://dx.doi.org/10.1371/journal.pone.0244233
_version_ 1783633421655867392
author Bae, Wan D.
Kim, Sungroul
Park, Choon-Sik
Alkobaisi, Shayma
Lee, Jongwon
Seo, Wonseok
Park, Jong Sook
Park, Sujung
Lee, Sangwoon
Lee, Jong Wook
author_facet Bae, Wan D.
Kim, Sungroul
Park, Choon-Sik
Alkobaisi, Shayma
Lee, Jongwon
Seo, Wonseok
Park, Jong Sook
Park, Sujung
Lee, Sangwoon
Lee, Jong Wook
author_sort Bae, Wan D.
collection PubMed
description Large-scale data sources, remote sensing technologies, and superior computing power have tremendously benefitted to environmental health study. Recently, various machine-learning algorithms were introduced to provide mechanistic insights about the heterogeneity of clustered data pertaining to the symptoms of each asthma patient and potential environmental risk factors. However, there is limited information on the performance of these machine learning tools. In this study, we compared the performance of ten machine-learning techniques. Using an advanced method of imbalanced sampling (IS), we improved the performance of nine conventional machine learning techniques predicting the association between exposure level to indoor air quality and change in patients’ peak expiratory flow rate (PEFR). We then proposed a deep learning method of transfer learning (TL) for further improvement in prediction accuracy. Our selected final prediction techniques (TL1_IS or TL2-IS) achieved a balanced accuracy median (interquartile range) of 66(56~76) % for TL1_IS and 68(63~78) % for TL2_IS. Precision levels for TL1_IS and TL2_IS were 68(62~72) % and 66(62~69) % while sensitivity levels were 58(50~67) % and 59(51~80) % from 25 patients which were approximately 1.08 (accuracy, precision) to 1.28 (sensitivity) times increased in terms of performance outcomes, compared to NN_IS. Our results indicate that the transfer machine learning technique with imbalanced sampling is a powerful tool to predict the change in PEFR due to exposure to indoor air including the concentration of particulate matter of 2.5 μm and carbon dioxide. This modeling technique is even applicable with small-sized or imbalanced dataset, which represents a personalized, real-world setting.
format Online
Article
Text
id pubmed-7790419
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77904192021-01-27 Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data Bae, Wan D. Kim, Sungroul Park, Choon-Sik Alkobaisi, Shayma Lee, Jongwon Seo, Wonseok Park, Jong Sook Park, Sujung Lee, Sangwoon Lee, Jong Wook PLoS One Research Article Large-scale data sources, remote sensing technologies, and superior computing power have tremendously benefitted to environmental health study. Recently, various machine-learning algorithms were introduced to provide mechanistic insights about the heterogeneity of clustered data pertaining to the symptoms of each asthma patient and potential environmental risk factors. However, there is limited information on the performance of these machine learning tools. In this study, we compared the performance of ten machine-learning techniques. Using an advanced method of imbalanced sampling (IS), we improved the performance of nine conventional machine learning techniques predicting the association between exposure level to indoor air quality and change in patients’ peak expiratory flow rate (PEFR). We then proposed a deep learning method of transfer learning (TL) for further improvement in prediction accuracy. Our selected final prediction techniques (TL1_IS or TL2-IS) achieved a balanced accuracy median (interquartile range) of 66(56~76) % for TL1_IS and 68(63~78) % for TL2_IS. Precision levels for TL1_IS and TL2_IS were 68(62~72) % and 66(62~69) % while sensitivity levels were 58(50~67) % and 59(51~80) % from 25 patients which were approximately 1.08 (accuracy, precision) to 1.28 (sensitivity) times increased in terms of performance outcomes, compared to NN_IS. Our results indicate that the transfer machine learning technique with imbalanced sampling is a powerful tool to predict the change in PEFR due to exposure to indoor air including the concentration of particulate matter of 2.5 μm and carbon dioxide. This modeling technique is even applicable with small-sized or imbalanced dataset, which represents a personalized, real-world setting. Public Library of Science 2021-01-07 /pmc/articles/PMC7790419/ /pubmed/33411771 http://dx.doi.org/10.1371/journal.pone.0244233 Text en © 2021 Bae et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bae, Wan D.
Kim, Sungroul
Park, Choon-Sik
Alkobaisi, Shayma
Lee, Jongwon
Seo, Wonseok
Park, Jong Sook
Park, Sujung
Lee, Sangwoon
Lee, Jong Wook
Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title_full Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title_fullStr Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title_full_unstemmed Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title_short Performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
title_sort performance improvement of machine learning techniques predicting the association of exacerbation of peak expiratory flow ratio with short term exposure level to indoor air quality using adult asthmatics clustered data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7790419/
https://www.ncbi.nlm.nih.gov/pubmed/33411771
http://dx.doi.org/10.1371/journal.pone.0244233
work_keys_str_mv AT baewand performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT kimsungroul performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT parkchoonsik performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT alkobaisishayma performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT leejongwon performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT seowonseok performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT parkjongsook performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT parksujung performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT leesangwoon performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata
AT leejongwook performanceimprovementofmachinelearningtechniquespredictingtheassociationofexacerbationofpeakexpiratoryflowratiowithshorttermexposureleveltoindoorairqualityusingadultasthmaticsclustereddata