Cargando…

Categorizing biomedicine images using novel image features and sparse coding representation

BACKGROUND: Images embedded in biomedical publications carry rich information that often concisely summarize key hypotheses adopted, methods employed, or results obtained in a published study. Therefore, they offer valuable clues for understanding main content in a biomedical publication. Prior stud...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sheng, Jianqiang, Xu, Songhua, Luo, Xiaonan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109834/ https://www.ncbi.nlm.nih.gov/pubmed/24565470 http://dx.doi.org/10.1186/1755-8794-6-S3-S8

_version_	1782327914590633984
author	Sheng, Jianqiang Xu, Songhua Luo, Xiaonan
author_facet	Sheng, Jianqiang Xu, Songhua Luo, Xiaonan
author_sort	Sheng, Jianqiang
collection	PubMed
description	BACKGROUND: Images embedded in biomedical publications carry rich information that often concisely summarize key hypotheses adopted, methods employed, or results obtained in a published study. Therefore, they offer valuable clues for understanding main content in a biomedical publication. Prior studies have pointed out the potential of mining images embedded in biomedical publications for automatically understanding and retrieving such images' associated source documents. Within the broad area of biomedical image processing, categorizing biomedical images is a fundamental step for building many advanced image analysis, retrieval, and mining applications. Similar to any automatic categorization effort, discriminative image features can provide the most crucial aid in the process. METHOD: We observe that many images embedded in biomedical publications carry versatile annotation text. Based on the locations of and the spatial relationships between these text elements in an image, we thus propose some novel image features for image categorization purpose, which quantitatively characterize the spatial positions and distributions of text elements inside a biomedical image. We further adopt a sparse coding representation (SCR) based technique to categorize images embedded in biomedical publications by leveraging our newly proposed image features. RESULTS: we randomly selected 990 images of the JPG format for use in our experiments where 310 images were used as training samples and the rest were used as the testing cases. We first segmented 310 sample images following the our proposed procedure. This step produced a total of 1035 sub-images. We then manually labeled all these sub-images according to the two-level hierarchical image taxonomy proposed by [1]. Among our annotation results, 316 are microscopy images, 126 are gel electrophoresis images, 135 are line charts, 156 are bar charts, 52 are spot charts, 25 are tables, 70 are flow charts, and the remaining 155 images are of the type "others". A serial of experimental results are obtained. Firstly, each image categorizing results is presented, and next image categorizing performance indexes such as precision, recall, F-score, are all listed. Different features which include conventional image features and our proposed novel features indicate different categorizing performance, and the results are demonstrated. Thirdly, we conduct an accuracy comparison between support vector machine classification method and our proposed sparse representation classification method. At last, our proposed approach is compared with three peer classification method and experimental results verify our impressively improved performance. CONCLUSIONS: Compared with conventional image features that do not exploit characteristics regarding text positions and distributions inside images embedded in biomedical publications, our proposed image features coupled with the SR based representation model exhibit superior performance for classifying biomedical images as demonstrated in our comparative benchmark study.
format	Online Article Text
id	pubmed-4109834
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-41098342014-08-04 Categorizing biomedicine images using novel image features and sparse coding representation Sheng, Jianqiang Xu, Songhua Luo, Xiaonan BMC Med Genomics Research BACKGROUND: Images embedded in biomedical publications carry rich information that often concisely summarize key hypotheses adopted, methods employed, or results obtained in a published study. Therefore, they offer valuable clues for understanding main content in a biomedical publication. Prior studies have pointed out the potential of mining images embedded in biomedical publications for automatically understanding and retrieving such images' associated source documents. Within the broad area of biomedical image processing, categorizing biomedical images is a fundamental step for building many advanced image analysis, retrieval, and mining applications. Similar to any automatic categorization effort, discriminative image features can provide the most crucial aid in the process. METHOD: We observe that many images embedded in biomedical publications carry versatile annotation text. Based on the locations of and the spatial relationships between these text elements in an image, we thus propose some novel image features for image categorization purpose, which quantitatively characterize the spatial positions and distributions of text elements inside a biomedical image. We further adopt a sparse coding representation (SCR) based technique to categorize images embedded in biomedical publications by leveraging our newly proposed image features. RESULTS: we randomly selected 990 images of the JPG format for use in our experiments where 310 images were used as training samples and the rest were used as the testing cases. We first segmented 310 sample images following the our proposed procedure. This step produced a total of 1035 sub-images. We then manually labeled all these sub-images according to the two-level hierarchical image taxonomy proposed by [1]. Among our annotation results, 316 are microscopy images, 126 are gel electrophoresis images, 135 are line charts, 156 are bar charts, 52 are spot charts, 25 are tables, 70 are flow charts, and the remaining 155 images are of the type "others". A serial of experimental results are obtained. Firstly, each image categorizing results is presented, and next image categorizing performance indexes such as precision, recall, F-score, are all listed. Different features which include conventional image features and our proposed novel features indicate different categorizing performance, and the results are demonstrated. Thirdly, we conduct an accuracy comparison between support vector machine classification method and our proposed sparse representation classification method. At last, our proposed approach is compared with three peer classification method and experimental results verify our impressively improved performance. CONCLUSIONS: Compared with conventional image features that do not exploit characteristics regarding text positions and distributions inside images embedded in biomedical publications, our proposed image features coupled with the SR based representation model exhibit superior performance for classifying biomedical images as demonstrated in our comparative benchmark study. BioMed Central 2013-11-11 /pmc/articles/PMC4109834/ /pubmed/24565470 http://dx.doi.org/10.1186/1755-8794-6-S3-S8 Text en Copyright © 2013 Sheng et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Sheng, Jianqiang Xu, Songhua Luo, Xiaonan Categorizing biomedicine images using novel image features and sparse coding representation
title	Categorizing biomedicine images using novel image features and sparse coding representation
title_full	Categorizing biomedicine images using novel image features and sparse coding representation
title_fullStr	Categorizing biomedicine images using novel image features and sparse coding representation
title_full_unstemmed	Categorizing biomedicine images using novel image features and sparse coding representation
title_short	Categorizing biomedicine images using novel image features and sparse coding representation
title_sort	categorizing biomedicine images using novel image features and sparse coding representation
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109834/ https://www.ncbi.nlm.nih.gov/pubmed/24565470 http://dx.doi.org/10.1186/1755-8794-6-S3-S8
work_keys_str_mv	AT shengjianqiang categorizingbiomedicineimagesusingnovelimagefeaturesandsparsecodingrepresentation AT xusonghua categorizingbiomedicineimagesusingnovelimagefeaturesandsparsecodingrepresentation AT luoxiaonan categorizingbiomedicineimagesusingnovelimagefeaturesandsparsecodingrepresentation

Categorizing biomedicine images using novel image features and sparse coding representation

Ejemplares similares