Cargando…

Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease

Pathologists use visual classification to assess patient kidney biopsy samples when diagnosing the underlying cause of kidney disease. However, the assessment is qualitative, or semi-quantitative at best, and reproducibility is challenging. To discover previously unknown features which predict patie...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Joonsang, Warner, Elisa, Shaikhouni, Salma, Bitzer, Markus, Kretzler, Matthias, Gipson, Debbie, Pennathur, Subramaniam, Bellovich, Keith, Bhat, Zeenat, Gadegbeku, Crystal, Massengill, Susan, Perumal, Kalyani, Saha, Jharna, Yang, Yingbao, Luo, Jinghui, Zhang, Xin, Mariani, Laura, Hodgin, Jeffrey B., Rao, Arvind
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8941143/
https://www.ncbi.nlm.nih.gov/pubmed/35318420
http://dx.doi.org/10.1038/s41598-022-08974-8
_version_ 1784673046746890240
author Lee, Joonsang
Warner, Elisa
Shaikhouni, Salma
Bitzer, Markus
Kretzler, Matthias
Gipson, Debbie
Pennathur, Subramaniam
Bellovich, Keith
Bhat, Zeenat
Gadegbeku, Crystal
Massengill, Susan
Perumal, Kalyani
Saha, Jharna
Yang, Yingbao
Luo, Jinghui
Zhang, Xin
Mariani, Laura
Hodgin, Jeffrey B.
Rao, Arvind
author_facet Lee, Joonsang
Warner, Elisa
Shaikhouni, Salma
Bitzer, Markus
Kretzler, Matthias
Gipson, Debbie
Pennathur, Subramaniam
Bellovich, Keith
Bhat, Zeenat
Gadegbeku, Crystal
Massengill, Susan
Perumal, Kalyani
Saha, Jharna
Yang, Yingbao
Luo, Jinghui
Zhang, Xin
Mariani, Laura
Hodgin, Jeffrey B.
Rao, Arvind
author_sort Lee, Joonsang
collection PubMed
description Pathologists use visual classification to assess patient kidney biopsy samples when diagnosing the underlying cause of kidney disease. However, the assessment is qualitative, or semi-quantitative at best, and reproducibility is challenging. To discover previously unknown features which predict patient outcomes and overcome substantial interobserver variability, we developed an unsupervised bag-of-words model. Our study applied to the C-PROBE cohort of patients with chronic kidney disease (CKD). 107,471 histopathology images were obtained from 161 biopsy cores and identified important morphological features in biopsy tissue that are highly predictive of the presence of CKD both at the time of biopsy and in one year. To evaluate the performance of our model, we estimated the AUC and its 95% confidence interval. We show that this method is reliable and reproducible and can achieve 0.93 AUC at predicting glomerular filtration rate at the time of biopsy as well as predicting a loss of function at one year. Additionally, with this method, we ranked the identified morphological features according to their importance as diagnostic markers for chronic kidney disease. In this study, we have demonstrated the feasibility of using an unsupervised machine learning method without human input in order to predict the level of kidney function in CKD. The results from our study indicate that the visual dictionary, or visual image pattern, obtained from unsupervised machine learning can predict outcomes using machine-derived values that correspond to both known and unknown clinically relevant features.
format Online
Article
Text
id pubmed-8941143
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-89411432022-03-28 Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease Lee, Joonsang Warner, Elisa Shaikhouni, Salma Bitzer, Markus Kretzler, Matthias Gipson, Debbie Pennathur, Subramaniam Bellovich, Keith Bhat, Zeenat Gadegbeku, Crystal Massengill, Susan Perumal, Kalyani Saha, Jharna Yang, Yingbao Luo, Jinghui Zhang, Xin Mariani, Laura Hodgin, Jeffrey B. Rao, Arvind Sci Rep Article Pathologists use visual classification to assess patient kidney biopsy samples when diagnosing the underlying cause of kidney disease. However, the assessment is qualitative, or semi-quantitative at best, and reproducibility is challenging. To discover previously unknown features which predict patient outcomes and overcome substantial interobserver variability, we developed an unsupervised bag-of-words model. Our study applied to the C-PROBE cohort of patients with chronic kidney disease (CKD). 107,471 histopathology images were obtained from 161 biopsy cores and identified important morphological features in biopsy tissue that are highly predictive of the presence of CKD both at the time of biopsy and in one year. To evaluate the performance of our model, we estimated the AUC and its 95% confidence interval. We show that this method is reliable and reproducible and can achieve 0.93 AUC at predicting glomerular filtration rate at the time of biopsy as well as predicting a loss of function at one year. Additionally, with this method, we ranked the identified morphological features according to their importance as diagnostic markers for chronic kidney disease. In this study, we have demonstrated the feasibility of using an unsupervised machine learning method without human input in order to predict the level of kidney function in CKD. The results from our study indicate that the visual dictionary, or visual image pattern, obtained from unsupervised machine learning can predict outcomes using machine-derived values that correspond to both known and unknown clinically relevant features. Nature Publishing Group UK 2022-03-22 /pmc/articles/PMC8941143/ /pubmed/35318420 http://dx.doi.org/10.1038/s41598-022-08974-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lee, Joonsang
Warner, Elisa
Shaikhouni, Salma
Bitzer, Markus
Kretzler, Matthias
Gipson, Debbie
Pennathur, Subramaniam
Bellovich, Keith
Bhat, Zeenat
Gadegbeku, Crystal
Massengill, Susan
Perumal, Kalyani
Saha, Jharna
Yang, Yingbao
Luo, Jinghui
Zhang, Xin
Mariani, Laura
Hodgin, Jeffrey B.
Rao, Arvind
Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title_full Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title_fullStr Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title_full_unstemmed Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title_short Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
title_sort unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8941143/
https://www.ncbi.nlm.nih.gov/pubmed/35318420
http://dx.doi.org/10.1038/s41598-022-08974-8
work_keys_str_mv AT leejoonsang unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT warnerelisa unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT shaikhounisalma unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT bitzermarkus unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT kretzlermatthias unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT gipsondebbie unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT pennathursubramaniam unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT bellovichkeith unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT bhatzeenat unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT gadegbekucrystal unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT massengillsusan unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT perumalkalyani unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT sahajharna unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT yangyingbao unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT luojinghui unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT zhangxin unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT marianilaura unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT hodginjeffreyb unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT raoarvind unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease
AT unsupervisedmachinelearningforidentifyingimportantvisualfeaturesthroughbagofwordsusinghistopathologydatafromchronickidneydisease