Cargando…

A multiresolution approach to automated classification of protein subcellular location images

BACKGROUND: Fluorescence microscopy is widely used to determine the subcellular location of proteins. Efforts to determine location on a proteome-wide basis create a need for automated methods to analyze the resulting images. Over the past ten years, the feasibility of using machine learning methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Chebira, Amina, Barbotin, Yann, Jackson, Charles, Merryman, Thomas, Srinivasa, Gowri, Murphy, Robert F, Kovačević, Jelena
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933440/
https://www.ncbi.nlm.nih.gov/pubmed/17578580
http://dx.doi.org/10.1186/1471-2105-8-210
_version_ 1782134328504877056
author Chebira, Amina
Barbotin, Yann
Jackson, Charles
Merryman, Thomas
Srinivasa, Gowri
Murphy, Robert F
Kovačević, Jelena
author_facet Chebira, Amina
Barbotin, Yann
Jackson, Charles
Merryman, Thomas
Srinivasa, Gowri
Murphy, Robert F
Kovačević, Jelena
author_sort Chebira, Amina
collection PubMed
description BACKGROUND: Fluorescence microscopy is widely used to determine the subcellular location of proteins. Efforts to determine location on a proteome-wide basis create a need for automated methods to analyze the resulting images. Over the past ten years, the feasibility of using machine learning methods to recognize all major subcellular location patterns has been convincingly demonstrated, using diverse feature sets and classifiers. On a well-studied data set of 2D HeLa single-cell images, the best performance to date, 91.5%, was obtained by including a set of multiresolution features. This demonstrates the value of multiresolution approaches to this important problem. RESULTS: We report here a novel approach for the classification of subcellular location patterns by classifying in multiresolution subspaces. Our system is able to work with any feature set and any classifier. It consists of multiresolution (MR) decomposition, followed by feature computation and classification in each MR subspace, yielding local decisions that are then combined into a global decision. With 26 texture features alone and a neural network classifier, we obtained an increase in accuracy on the 2D HeLa data set to 95.3%. CONCLUSION: We demonstrate that the space-frequency localized information in the multiresolution subspaces adds significantly to the discriminative power of the system. Moreover, we show that a vastly reduced set of features is sufficient, consisting of our novel modified Haralick texture features. Our proposed system is general, allowing for any combinations of sets of features and any combination of classifiers.
format Text
id pubmed-1933440
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19334402007-07-26 A multiresolution approach to automated classification of protein subcellular location images Chebira, Amina Barbotin, Yann Jackson, Charles Merryman, Thomas Srinivasa, Gowri Murphy, Robert F Kovačević, Jelena BMC Bioinformatics Research Article BACKGROUND: Fluorescence microscopy is widely used to determine the subcellular location of proteins. Efforts to determine location on a proteome-wide basis create a need for automated methods to analyze the resulting images. Over the past ten years, the feasibility of using machine learning methods to recognize all major subcellular location patterns has been convincingly demonstrated, using diverse feature sets and classifiers. On a well-studied data set of 2D HeLa single-cell images, the best performance to date, 91.5%, was obtained by including a set of multiresolution features. This demonstrates the value of multiresolution approaches to this important problem. RESULTS: We report here a novel approach for the classification of subcellular location patterns by classifying in multiresolution subspaces. Our system is able to work with any feature set and any classifier. It consists of multiresolution (MR) decomposition, followed by feature computation and classification in each MR subspace, yielding local decisions that are then combined into a global decision. With 26 texture features alone and a neural network classifier, we obtained an increase in accuracy on the 2D HeLa data set to 95.3%. CONCLUSION: We demonstrate that the space-frequency localized information in the multiresolution subspaces adds significantly to the discriminative power of the system. Moreover, we show that a vastly reduced set of features is sufficient, consisting of our novel modified Haralick texture features. Our proposed system is general, allowing for any combinations of sets of features and any combination of classifiers. BioMed Central 2007-06-19 /pmc/articles/PMC1933440/ /pubmed/17578580 http://dx.doi.org/10.1186/1471-2105-8-210 Text en Copyright © 2007 Chebira et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chebira, Amina
Barbotin, Yann
Jackson, Charles
Merryman, Thomas
Srinivasa, Gowri
Murphy, Robert F
Kovačević, Jelena
A multiresolution approach to automated classification of protein subcellular location images
title A multiresolution approach to automated classification of protein subcellular location images
title_full A multiresolution approach to automated classification of protein subcellular location images
title_fullStr A multiresolution approach to automated classification of protein subcellular location images
title_full_unstemmed A multiresolution approach to automated classification of protein subcellular location images
title_short A multiresolution approach to automated classification of protein subcellular location images
title_sort multiresolution approach to automated classification of protein subcellular location images
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933440/
https://www.ncbi.nlm.nih.gov/pubmed/17578580
http://dx.doi.org/10.1186/1471-2105-8-210
work_keys_str_mv AT chebiraamina amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT barbotinyann amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT jacksoncharles amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT merrymanthomas amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT srinivasagowri amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT murphyrobertf amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT kovacevicjelena amultiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT chebiraamina multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT barbotinyann multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT jacksoncharles multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT merrymanthomas multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT srinivasagowri multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT murphyrobertf multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages
AT kovacevicjelena multiresolutionapproachtoautomatedclassificationofproteinsubcellularlocationimages