Cargando…
Machine learning for morbid glomerular hypertrophy
A practical research method integrating data-driven machine learning with conventional model-driven statistics is sought after in medicine. Although glomerular hypertrophy (or a large renal corpuscle) on renal biopsy has pathophysiological implications, it is often misdiagnosed as adaptive/compensat...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646707/ https://www.ncbi.nlm.nih.gov/pubmed/36351996 http://dx.doi.org/10.1038/s41598-022-23882-7 |
_version_ | 1784827227044577280 |
---|---|
author | Ushio, Yusuke Kataoka, Hiroshi Iwadoh, Kazuhiro Ohara, Mamiko Suzuki, Tomo Hirata, Maiko Manabe, Shun Kawachi, Keiko Akihisa, Taro Makabe, Shiho Sato, Masayo Iwasa, Naomi Yoshida, Rie Hoshino, Junichi Mochizuki, Toshio Tsuchiya, Ken Nitta, Kosaku |
author_facet | Ushio, Yusuke Kataoka, Hiroshi Iwadoh, Kazuhiro Ohara, Mamiko Suzuki, Tomo Hirata, Maiko Manabe, Shun Kawachi, Keiko Akihisa, Taro Makabe, Shiho Sato, Masayo Iwasa, Naomi Yoshida, Rie Hoshino, Junichi Mochizuki, Toshio Tsuchiya, Ken Nitta, Kosaku |
author_sort | Ushio, Yusuke |
collection | PubMed |
description | A practical research method integrating data-driven machine learning with conventional model-driven statistics is sought after in medicine. Although glomerular hypertrophy (or a large renal corpuscle) on renal biopsy has pathophysiological implications, it is often misdiagnosed as adaptive/compensatory hypertrophy. Using a generative machine learning method, we aimed to explore the factors associated with a maximal glomerular diameter of ≥ 242.3 μm. Using the frequency-of-usage variable ranking in generative models, we defined the machine learning scores with symbolic regression via genetic programming (SR via GP). We compared important variables selected by SR with those selected by a point-biserial correlation coefficient using multivariable logistic and linear regressions to validate discriminatory ability, goodness-of-fit, and collinearity. Body mass index, complement component C3, serum total protein, arteriolosclerosis, C-reactive protein, and the Oxford E1 score were ranked among the top 10 variables with high machine learning scores using SR via GP, while the estimated glomerular filtration rate was ranked 46 among the 60 variables. In multivariable analyses, the R(2) value was higher (0.61 vs. 0.45), and the corrected Akaike Information Criterion value was lower (402.7 vs. 417.2) with variables selected with SR than those selected with point-biserial r. There were two variables with variance inflation factors higher than 5 in those using point-biserial r and none in SR. Data-driven machine learning models may be useful in identifying significant and insignificant correlated factors. Our method may be generalized to other medical research due to the procedural simplicity of using top-ranked variables selected by machine learning. |
format | Online Article Text |
id | pubmed-9646707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96467072022-11-15 Machine learning for morbid glomerular hypertrophy Ushio, Yusuke Kataoka, Hiroshi Iwadoh, Kazuhiro Ohara, Mamiko Suzuki, Tomo Hirata, Maiko Manabe, Shun Kawachi, Keiko Akihisa, Taro Makabe, Shiho Sato, Masayo Iwasa, Naomi Yoshida, Rie Hoshino, Junichi Mochizuki, Toshio Tsuchiya, Ken Nitta, Kosaku Sci Rep Article A practical research method integrating data-driven machine learning with conventional model-driven statistics is sought after in medicine. Although glomerular hypertrophy (or a large renal corpuscle) on renal biopsy has pathophysiological implications, it is often misdiagnosed as adaptive/compensatory hypertrophy. Using a generative machine learning method, we aimed to explore the factors associated with a maximal glomerular diameter of ≥ 242.3 μm. Using the frequency-of-usage variable ranking in generative models, we defined the machine learning scores with symbolic regression via genetic programming (SR via GP). We compared important variables selected by SR with those selected by a point-biserial correlation coefficient using multivariable logistic and linear regressions to validate discriminatory ability, goodness-of-fit, and collinearity. Body mass index, complement component C3, serum total protein, arteriolosclerosis, C-reactive protein, and the Oxford E1 score were ranked among the top 10 variables with high machine learning scores using SR via GP, while the estimated glomerular filtration rate was ranked 46 among the 60 variables. In multivariable analyses, the R(2) value was higher (0.61 vs. 0.45), and the corrected Akaike Information Criterion value was lower (402.7 vs. 417.2) with variables selected with SR than those selected with point-biserial r. There were two variables with variance inflation factors higher than 5 in those using point-biserial r and none in SR. Data-driven machine learning models may be useful in identifying significant and insignificant correlated factors. Our method may be generalized to other medical research due to the procedural simplicity of using top-ranked variables selected by machine learning. Nature Publishing Group UK 2022-11-09 /pmc/articles/PMC9646707/ /pubmed/36351996 http://dx.doi.org/10.1038/s41598-022-23882-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Ushio, Yusuke Kataoka, Hiroshi Iwadoh, Kazuhiro Ohara, Mamiko Suzuki, Tomo Hirata, Maiko Manabe, Shun Kawachi, Keiko Akihisa, Taro Makabe, Shiho Sato, Masayo Iwasa, Naomi Yoshida, Rie Hoshino, Junichi Mochizuki, Toshio Tsuchiya, Ken Nitta, Kosaku Machine learning for morbid glomerular hypertrophy |
title | Machine learning for morbid glomerular hypertrophy |
title_full | Machine learning for morbid glomerular hypertrophy |
title_fullStr | Machine learning for morbid glomerular hypertrophy |
title_full_unstemmed | Machine learning for morbid glomerular hypertrophy |
title_short | Machine learning for morbid glomerular hypertrophy |
title_sort | machine learning for morbid glomerular hypertrophy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646707/ https://www.ncbi.nlm.nih.gov/pubmed/36351996 http://dx.doi.org/10.1038/s41598-022-23882-7 |
work_keys_str_mv | AT ushioyusuke machinelearningformorbidglomerularhypertrophy AT kataokahiroshi machinelearningformorbidglomerularhypertrophy AT iwadohkazuhiro machinelearningformorbidglomerularhypertrophy AT oharamamiko machinelearningformorbidglomerularhypertrophy AT suzukitomo machinelearningformorbidglomerularhypertrophy AT hiratamaiko machinelearningformorbidglomerularhypertrophy AT manabeshun machinelearningformorbidglomerularhypertrophy AT kawachikeiko machinelearningformorbidglomerularhypertrophy AT akihisataro machinelearningformorbidglomerularhypertrophy AT makabeshiho machinelearningformorbidglomerularhypertrophy AT satomasayo machinelearningformorbidglomerularhypertrophy AT iwasanaomi machinelearningformorbidglomerularhypertrophy AT yoshidarie machinelearningformorbidglomerularhypertrophy AT hoshinojunichi machinelearningformorbidglomerularhypertrophy AT mochizukitoshio machinelearningformorbidglomerularhypertrophy AT tsuchiyaken machinelearningformorbidglomerularhypertrophy AT nittakosaku machinelearningformorbidglomerularhypertrophy |