Cargando…

Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces

CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By repres...

Descripción completa

Detalles Bibliográficos
Autores principales: Sridhar, Akshay, Doyle, Scott, Madabhushi, Anant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4498317/
https://www.ncbi.nlm.nih.gov/pubmed/26167385
http://dx.doi.org/10.4103/2153-3539.159441
_version_ 1782380605271441408
author Sridhar, Akshay
Doyle, Scott
Madabhushi, Anant
author_facet Sridhar, Akshay
Doyle, Scott
Madabhushi, Anant
author_sort Sridhar, Akshay
collection PubMed
description CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By representing images via a set of quantitative image descriptors, the similarity between a query image with respect to archived, annotated images in a database can be computed and the most similar images retrieved. Recently, non-linear dimensionality reduction methods have become popular for embedding high-dimensional data into a reduced-dimensional space while preserving local object adjacencies, thereby allowing for object similarity to be determined more accurately in the reduced-dimensional space. However, most dimensionality reduction methods implicitly assume, in computing the reduced-dimensional representation, that all features are equally important. AIMS: In this paper we present boosted spectral embedding(BoSE), which utilizes a boosted distance metric to selectively weight individual features (based on training data) to subsequently map the data into a reduced-dimensional space. SETTINGS AND DESIGN: BoSE is evaluated against spectral embedding (SE) (which employs equal feature weighting) in the context of CBIR of digitized prostate and breast cancer histopathology images. MATERIALS AND METHODS: The following datasets, which were comprised of a total of 154 hematoxylin and eosin stained histopathology images, were used: (1) Prostate cancer histopathology (benign vs. malignant), (2) estrogen receptor (ER) + breast cancer histopathology (low vs. high grade), and (3) HER2+ breast cancer histopathology (low vs. high levels of lymphocytic infiltration). STATISTICAL ANALYSIS USED: We plotted and calculated the area under precision-recall curves (AUPRC) and calculated classification accuracy using the Random Forest classifier. RESULTS: BoSE outperformed SE both in terms of CBIR-based (area under the precision-recall curve) and classifier-based (classification accuracy) on average across all of the dimensions tested for all three datasets: (1) Prostate cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.63; Accuracy: BoSE = 0.93, SE = 0.80), (2) ER + breast cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.68; Accuracy: BoSE = 0.96, SE = 0.96), and (3) HER2+ breast cancer histopathology (AUPRC: BoSE = 0.54, SE = 0.44; Accuracy: BoSE = 0.93, SE = 0.91). CONCLUSION: Our results suggest that BoSE could serve as an important tool for CBIR and classification of high-dimensional biomedical data.
format Online
Article
Text
id pubmed-4498317
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-44983172015-07-12 Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces Sridhar, Akshay Doyle, Scott Madabhushi, Anant J Pathol Inform Original Article CONTEXT: Content-based image retrieval (CBIR) systems allow for retrieval of images from within a database that are similar in visual content to a query image. This is useful for digital pathology, where text-based descriptors alone might be inadequate to accurately describe image content. By representing images via a set of quantitative image descriptors, the similarity between a query image with respect to archived, annotated images in a database can be computed and the most similar images retrieved. Recently, non-linear dimensionality reduction methods have become popular for embedding high-dimensional data into a reduced-dimensional space while preserving local object adjacencies, thereby allowing for object similarity to be determined more accurately in the reduced-dimensional space. However, most dimensionality reduction methods implicitly assume, in computing the reduced-dimensional representation, that all features are equally important. AIMS: In this paper we present boosted spectral embedding(BoSE), which utilizes a boosted distance metric to selectively weight individual features (based on training data) to subsequently map the data into a reduced-dimensional space. SETTINGS AND DESIGN: BoSE is evaluated against spectral embedding (SE) (which employs equal feature weighting) in the context of CBIR of digitized prostate and breast cancer histopathology images. MATERIALS AND METHODS: The following datasets, which were comprised of a total of 154 hematoxylin and eosin stained histopathology images, were used: (1) Prostate cancer histopathology (benign vs. malignant), (2) estrogen receptor (ER) + breast cancer histopathology (low vs. high grade), and (3) HER2+ breast cancer histopathology (low vs. high levels of lymphocytic infiltration). STATISTICAL ANALYSIS USED: We plotted and calculated the area under precision-recall curves (AUPRC) and calculated classification accuracy using the Random Forest classifier. RESULTS: BoSE outperformed SE both in terms of CBIR-based (area under the precision-recall curve) and classifier-based (classification accuracy) on average across all of the dimensions tested for all three datasets: (1) Prostate cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.63; Accuracy: BoSE = 0.93, SE = 0.80), (2) ER + breast cancer histopathology (AUPRC: BoSE = 0.79, SE = 0.68; Accuracy: BoSE = 0.96, SE = 0.96), and (3) HER2+ breast cancer histopathology (AUPRC: BoSE = 0.54, SE = 0.44; Accuracy: BoSE = 0.93, SE = 0.91). CONCLUSION: Our results suggest that BoSE could serve as an important tool for CBIR and classification of high-dimensional biomedical data. Medknow Publications & Media Pvt Ltd 2015-06-29 /pmc/articles/PMC4498317/ /pubmed/26167385 http://dx.doi.org/10.4103/2153-3539.159441 Text en Copyright: © 2015 Sridhar A. http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Original Article
Sridhar, Akshay
Doyle, Scott
Madabhushi, Anant
Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title_full Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title_fullStr Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title_full_unstemmed Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title_short Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
title_sort content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4498317/
https://www.ncbi.nlm.nih.gov/pubmed/26167385
http://dx.doi.org/10.4103/2153-3539.159441
work_keys_str_mv AT sridharakshay contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces
AT doylescott contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces
AT madabhushianant contentbasedimageretrievalofdigitizedhistopathologyinboostedspectrallyembeddedspaces