Cargando…

Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies

BACKGROUND: We describe a suite of tools and methods that form a core set of capabilities for researchers and clinical investigators to evaluate multiple analytical pipelines and quantify sensitivity and variability of the results while conducting large-scale studies in investigative pathology and o...

Descripción completa

Detalles Bibliográficos
Autores principales: Kurc, Tahsin, Qi, Xin, Wang, Daihou, Wang, Fusheng, Teodoro, George, Cooper, Lee, Nalisnik, Michael, Yang, Lin, Saltz, Joel, Foran, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667532/
https://www.ncbi.nlm.nih.gov/pubmed/26627175
http://dx.doi.org/10.1186/s12859-015-0831-6
_version_ 1782403848771469312
author Kurc, Tahsin
Qi, Xin
Wang, Daihou
Wang, Fusheng
Teodoro, George
Cooper, Lee
Nalisnik, Michael
Yang, Lin
Saltz, Joel
Foran, David J.
author_facet Kurc, Tahsin
Qi, Xin
Wang, Daihou
Wang, Fusheng
Teodoro, George
Cooper, Lee
Nalisnik, Michael
Yang, Lin
Saltz, Joel
Foran, David J.
author_sort Kurc, Tahsin
collection PubMed
description BACKGROUND: We describe a suite of tools and methods that form a core set of capabilities for researchers and clinical investigators to evaluate multiple analytical pipelines and quantify sensitivity and variability of the results while conducting large-scale studies in investigative pathology and oncology. The overarching objective of the current investigation is to address the challenges of large data sizes and high computational demands. RESULTS: The proposed tools and methods take advantage of state-of-the-art parallel machines and efficient content-based image searching strategies. The content based image retrieval (CBIR) algorithms can quickly detect and retrieve image patches similar to a query patch using a hierarchical analysis approach. The analysis component based on high performance computing can carry out consensus clustering on 500,000 data points using a large shared memory system. CONCLUSIONS: Our work demonstrates efficient CBIR algorithms and high performance computing can be leveraged for efficient analysis of large microscopy images to meet the challenges of clinically salient applications in pathology. These technologies enable researchers and clinical investigators to make more effective use of the rich informational content contained within digitized microscopy specimens.
format Online
Article
Text
id pubmed-4667532
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46675322015-12-03 Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies Kurc, Tahsin Qi, Xin Wang, Daihou Wang, Fusheng Teodoro, George Cooper, Lee Nalisnik, Michael Yang, Lin Saltz, Joel Foran, David J. BMC Bioinformatics Research Article BACKGROUND: We describe a suite of tools and methods that form a core set of capabilities for researchers and clinical investigators to evaluate multiple analytical pipelines and quantify sensitivity and variability of the results while conducting large-scale studies in investigative pathology and oncology. The overarching objective of the current investigation is to address the challenges of large data sizes and high computational demands. RESULTS: The proposed tools and methods take advantage of state-of-the-art parallel machines and efficient content-based image searching strategies. The content based image retrieval (CBIR) algorithms can quickly detect and retrieve image patches similar to a query patch using a hierarchical analysis approach. The analysis component based on high performance computing can carry out consensus clustering on 500,000 data points using a large shared memory system. CONCLUSIONS: Our work demonstrates efficient CBIR algorithms and high performance computing can be leveraged for efficient analysis of large microscopy images to meet the challenges of clinically salient applications in pathology. These technologies enable researchers and clinical investigators to make more effective use of the rich informational content contained within digitized microscopy specimens. BioMed Central 2015-12-01 /pmc/articles/PMC4667532/ /pubmed/26627175 http://dx.doi.org/10.1186/s12859-015-0831-6 Text en © Kurc et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kurc, Tahsin
Qi, Xin
Wang, Daihou
Wang, Fusheng
Teodoro, George
Cooper, Lee
Nalisnik, Michael
Yang, Lin
Saltz, Joel
Foran, David J.
Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title_full Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title_fullStr Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title_full_unstemmed Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title_short Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies
title_sort scalable analysis of big pathology image data cohorts using efficient methods and high-performance computing strategies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667532/
https://www.ncbi.nlm.nih.gov/pubmed/26627175
http://dx.doi.org/10.1186/s12859-015-0831-6
work_keys_str_mv AT kurctahsin scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT qixin scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT wangdaihou scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT wangfusheng scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT teodorogeorge scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT cooperlee scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT nalisnikmichael scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT yanglin scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT saltzjoel scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies
AT forandavidj scalableanalysisofbigpathologyimagedatacohortsusingefficientmethodsandhighperformancecomputingstrategies