Cargando…

Fast automated cell phenotype image classification

BACKGROUND: The genomic revolution has led to rapid growth in sequencing of genes and proteins, and attention is now turning to the function of the encoded proteins. In this respect, microscope imaging of a protein's sub-cellular localisation is proving invaluable, and recent advances in automa...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hamilton, Nicholas A, Pantelic, Radosav S, Hanson, Kelly, Teasdale, Rohan D
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847687/ https://www.ncbi.nlm.nih.gov/pubmed/17394669 http://dx.doi.org/10.1186/1471-2105-8-110

_version_	1782132911078637568
author	Hamilton, Nicholas A Pantelic, Radosav S Hanson, Kelly Teasdale, Rohan D
author_facet	Hamilton, Nicholas A Pantelic, Radosav S Hanson, Kelly Teasdale, Rohan D
author_sort	Hamilton, Nicholas A
collection	PubMed
description	BACKGROUND: The genomic revolution has led to rapid growth in sequencing of genes and proteins, and attention is now turning to the function of the encoded proteins. In this respect, microscope imaging of a protein's sub-cellular localisation is proving invaluable, and recent advances in automated fluorescent microscopy allow protein localisations to be imaged in high throughput. Hence there is a need for large scale automated computational techniques to efficiently quantify, distinguish and classify sub-cellular images. While image statistics have proved highly successful in distinguishing localisation, commonly used measures suffer from being relatively slow to compute, and often require cells to be individually selected from experimental images, thus limiting both throughput and the range of potential applications. Here we introduce threshold adjacency statistics, the essence which is to threshold the image and to count the number of above threshold pixels with a given number of above threshold pixels adjacent. These novel measures are shown to distinguish and classify images of distinct sub-cellular localization with high speed and accuracy without image cropping. RESULTS: Threshold adjacency statistics are applied to classification of protein sub-cellular localization images. They are tested on two image sets (available for download), one for which fluorescently tagged proteins are endogenously expressed in 10 sub-cellular locations, and another for which proteins are transfected into 11 locations. For each image set, a support vector machine was trained and tested. Classification accuracies of 94.4% and 86.6% are obtained on the endogenous and transfected sets, respectively. Threshold adjacency statistics are found to provide comparable or higher accuracy than other commonly used statistics while being an order of magnitude faster to calculate. Further, threshold adjacency statistics in combination with Haralick measures give accuracies of 98.2% and 93.2% on the endogenous and transfected sets, respectively. CONCLUSION: Threshold adjacency statistics have the potential to greatly extend the scale and range of applications of image statistics in computational image analysis. They remove the need for cropping of individual cells from images, and are an order of magnitude faster to calculate than other commonly used statistics while providing comparable or better classification accuracy, both essential requirements for application to large-scale approaches.
format	Text
id	pubmed-1847687
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-18476872007-04-05 Fast automated cell phenotype image classification Hamilton, Nicholas A Pantelic, Radosav S Hanson, Kelly Teasdale, Rohan D BMC Bioinformatics Methodology Article BACKGROUND: The genomic revolution has led to rapid growth in sequencing of genes and proteins, and attention is now turning to the function of the encoded proteins. In this respect, microscope imaging of a protein's sub-cellular localisation is proving invaluable, and recent advances in automated fluorescent microscopy allow protein localisations to be imaged in high throughput. Hence there is a need for large scale automated computational techniques to efficiently quantify, distinguish and classify sub-cellular images. While image statistics have proved highly successful in distinguishing localisation, commonly used measures suffer from being relatively slow to compute, and often require cells to be individually selected from experimental images, thus limiting both throughput and the range of potential applications. Here we introduce threshold adjacency statistics, the essence which is to threshold the image and to count the number of above threshold pixels with a given number of above threshold pixels adjacent. These novel measures are shown to distinguish and classify images of distinct sub-cellular localization with high speed and accuracy without image cropping. RESULTS: Threshold adjacency statistics are applied to classification of protein sub-cellular localization images. They are tested on two image sets (available for download), one for which fluorescently tagged proteins are endogenously expressed in 10 sub-cellular locations, and another for which proteins are transfected into 11 locations. For each image set, a support vector machine was trained and tested. Classification accuracies of 94.4% and 86.6% are obtained on the endogenous and transfected sets, respectively. Threshold adjacency statistics are found to provide comparable or higher accuracy than other commonly used statistics while being an order of magnitude faster to calculate. Further, threshold adjacency statistics in combination with Haralick measures give accuracies of 98.2% and 93.2% on the endogenous and transfected sets, respectively. CONCLUSION: Threshold adjacency statistics have the potential to greatly extend the scale and range of applications of image statistics in computational image analysis. They remove the need for cropping of individual cells from images, and are an order of magnitude faster to calculate than other commonly used statistics while providing comparable or better classification accuracy, both essential requirements for application to large-scale approaches. BioMed Central 2007-03-30 /pmc/articles/PMC1847687/ /pubmed/17394669 http://dx.doi.org/10.1186/1471-2105-8-110 Text en Copyright © 2007 Hamilton et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Hamilton, Nicholas A Pantelic, Radosav S Hanson, Kelly Teasdale, Rohan D Fast automated cell phenotype image classification
title	Fast automated cell phenotype image classification
title_full	Fast automated cell phenotype image classification
title_fullStr	Fast automated cell phenotype image classification
title_full_unstemmed	Fast automated cell phenotype image classification
title_short	Fast automated cell phenotype image classification
title_sort	fast automated cell phenotype image classification
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847687/ https://www.ncbi.nlm.nih.gov/pubmed/17394669 http://dx.doi.org/10.1186/1471-2105-8-110
work_keys_str_mv	AT hamiltonnicholasa fastautomatedcellphenotypeimageclassification AT pantelicradosavs fastautomatedcellphenotypeimageclassification AT hansonkelly fastautomatedcellphenotypeimageclassification AT teasdalerohand fastautomatedcellphenotypeimageclassification

Fast automated cell phenotype image classification

Ejemplares similares