Cargando…
Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By repres...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Medknow Publications & Media Pvt Ltd
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4498317/ https://www.ncbi.nlm.nih.gov/pubmed/26167385 http://dx.doi.org/10.4103/2153-3539.159441 |
_version_ | 1782380605271441408 |
---|---|
author | Sridhar, Akshay Doyle, Scott Madabhushi, Anant |
author_facet | Sridhar, Akshay Doyle, Scott Madabhushi, Anant |
author_sort | Sridhar, Akshay |
collection | PubMed |
description | CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By representing images via a set of quantitative image descriptors, the similarity between a query image with respect to archived, annotated images in a database can be computed and the most similar images retrieved. Recently, non-linear dimensionality reduction methods have become popular for embedding high-dimensional data into a reduced-dimensional space while preserving local object adjacencies, thereby allowing for object similarity to be determined more accurately in the reduced-dimensional space. However, most dimensionality reduction methods implicitly assume, in computing the reduced-dimensional representation, that all features are equally important. AIMS: In this paper we present boosted spectral embedding(BoSE), which utilizes a boosted distance metric to selectively weight individual features (based on training data) to subsequently map the data into a reduced-dimensional space. SETTINGS AND DESIGN: BoSE is evaluated against spectral embedding (SE) (which employs equal feature weighting) in the context of CBIR of digitized prostate and breast cancer histopathology images. MATERIALS AND METHODS: The following datasets, which were comprised of a total of 154 hematoxylin and eosin stained histopathology images, were used: (1) Prostate cancer histopathology (benign vs. malignant), (2) estrogen receptor (ER) + breast cancer histopathology (low vs. high grade), and (3) HER2+ breast cancer histopathology (low vs. high levels of lymphocytic infiltration). STATISTICAL ANALYSIS USED: We plotted and calculated the area under precision-recall curves (AUPRC) and calculated classification accuracy using the Random Forest classifier. RESULTS: BoSE outperformed SE both in terms of CBIR-based (area under the precision-recall curve) and classifier-based (classification accuracy) on average across all of the dimensions tested for all three datasets: (1) Prostate cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.63; Accuracy: BoSE = 0.93, SE = 0.80), (2) ER + breast cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.68; Accuracy: BoSE = 0.96, SE = 0.96), and (3) HER2+ breast cancer histopathology (AUPRC: BoSE = 0.54, SE = 0.44; Accuracy: BoSE = 0.93, SE = 0.91). CONCLUSION: Our results suggest that BoSE could serve as an important tool for CBIR and classification of high-dimensional biomedical data. |
format | Online Article Text |
id | pubmed-4498317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Medknow Publications & Media Pvt Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-44983172015-07-12 Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces Sridhar, Akshay Doyle, Scott Madabhushi, Anant J Pathol Inform Original Article CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By representing images via a set of quantitative image descriptors, the similarity between a query image with respect to archived, annotated images in a database can be computed and the most similar images retrieved. Recently, non-linear dimensionality reduction methods have become popular for embedding high-dimensional data into a reduced-dimensional space while preserving local object adjacencies, thereby allowing for object similarity to be determined more accurately in the reduced-dimensional space. However, most dimensionality reduction methods implicitly assume, in computing the reduced-dimensional representation, that all features are equally important. AIMS: In this paper we present boosted spectral embedding(BoSE), which utilizes a boosted distance metric to selectively weight individual features (based on training data) to subsequently map the data into a reduced-dimensional space. SETTINGS AND DESIGN: BoSE is evaluated against spectral embedding (SE) (which employs equal feature weighting) in the context of CBIR of digitized prostate and breast cancer histopathology images. MATERIALS AND METHODS: The following datasets, which were comprised of a total of 154 hematoxylin and eosin stained histopathology images, were used: (1) Prostate cancer histopathology (benign vs. malignant), (2) estrogen receptor (ER) + breast cancer histopathology (low vs. high grade), and (3) HER2+ breast cancer histopathology (low vs. high levels of lymphocytic infiltration). STATISTICAL ANALYSIS USED: We plotted and calculated the area under precision-recall curves (AUPRC) and calculated classification accuracy using the Random Forest classifier. RESULTS: BoSE outperformed SE both in terms of CBIR-based (area under the precision-recall curve) and classifier-based (classification accuracy) on average across all of the dimensions tested for all three datasets: (1) Prostate cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.63; Accuracy: BoSE = 0.93, SE = 0.80), (2) ER + breast cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.68; Accuracy: BoSE = 0.96, SE = 0.96), and (3) HER2+ breast cancer histopathology (AUPRC: BoSE = 0.54, SE = 0.44; Accuracy: BoSE = 0.93, SE = 0.91). CONCLUSION: Our results suggest that BoSE could serve as an important tool for CBIR and classification of high-dimensional biomedical data. Medknow Publications & Media Pvt Ltd 2015-06-29 /pmc/articles/PMC4498317/ /pubmed/26167385 http://dx.doi.org/10.4103/2153-3539.159441 Text en Copyright: © 2015 Sridhar A. http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Original Article Sridhar, Akshay Doyle, Scott Madabhushi, Anant Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title | Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title_full | Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title_fullStr | Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title_full_unstemmed | Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title_short | Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
title_sort | content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4498317/ https://www.ncbi.nlm.nih.gov/pubmed/26167385 http://dx.doi.org/10.4103/2153-3539.159441 |
work_keys_str_mv | AT sridharakshay contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces AT doylescott contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces AT madabhushianant contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces |