Cargando…

Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo

BACKGROUND: Image analysis is an essential component in many biological experiments that study gene expression, cell cycle progression, and protein localization. A protocol for tracking the expression of individual C. elegans genes was developed that collects image samples of a developing embryo by...

Descripción completa

Detalles Bibliográficos
Autores principales: Aydin, Zafer, Murray, John I, Waterston, Robert H, Noble, William S
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2838868/
https://www.ncbi.nlm.nih.gov/pubmed/20146825
http://dx.doi.org/10.1186/1471-2105-11-84
_version_ 1782178904626167808
author Aydin, Zafer
Murray, John I
Waterston, Robert H
Noble, William S
author_facet Aydin, Zafer
Murray, John I
Waterston, Robert H
Noble, William S
author_sort Aydin, Zafer
collection PubMed
description BACKGROUND: Image analysis is an essential component in many biological experiments that study gene expression, cell cycle progression, and protein localization. A protocol for tracking the expression of individual C. elegans genes was developed that collects image samples of a developing embryo by 3-D time lapse microscopy. In this protocol, a program called StarryNite performs the automatic recognition of fluorescently labeled cells and traces their lineage. However, due to the amount of noise present in the data and due to the challenges introduced by increasing number of cells in later stages of development, this program is not error free. In the current version, the error correction (i.e., editing) is performed manually using a graphical interface tool named AceTree, which is specifically developed for this task. For a single experiment, this manual annotation task takes several hours. RESULTS: In this paper, we reduce the time required to correct errors made by StarryNite. We target one of the most frequent error types (movements annotated as divisions) and train a support vector machine (SVM) classifier to decide whether a division call made by StarryNite is correct or not. We show, via cross-validation experiments on several benchmark data sets, that the SVM successfully identifies this type of error significantly. A new version of StarryNite that includes the trained SVM classifier is available at http://starrynite.sourceforge.net. CONCLUSIONS: We demonstrate the utility of a machine learning approach to error annotation for StarryNite. In the process, we also provide some general methodologies for developing and validating a classifier with respect to a given pattern recognition task.
format Text
id pubmed-2838868
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28388682010-03-16 Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo Aydin, Zafer Murray, John I Waterston, Robert H Noble, William S BMC Bioinformatics Research article BACKGROUND: Image analysis is an essential component in many biological experiments that study gene expression, cell cycle progression, and protein localization. A protocol for tracking the expression of individual C. elegans genes was developed that collects image samples of a developing embryo by 3-D time lapse microscopy. In this protocol, a program called StarryNite performs the automatic recognition of fluorescently labeled cells and traces their lineage. However, due to the amount of noise present in the data and due to the challenges introduced by increasing number of cells in later stages of development, this program is not error free. In the current version, the error correction (i.e., editing) is performed manually using a graphical interface tool named AceTree, which is specifically developed for this task. For a single experiment, this manual annotation task takes several hours. RESULTS: In this paper, we reduce the time required to correct errors made by StarryNite. We target one of the most frequent error types (movements annotated as divisions) and train a support vector machine (SVM) classifier to decide whether a division call made by StarryNite is correct or not. We show, via cross-validation experiments on several benchmark data sets, that the SVM successfully identifies this type of error significantly. A new version of StarryNite that includes the trained SVM classifier is available at http://starrynite.sourceforge.net. CONCLUSIONS: We demonstrate the utility of a machine learning approach to error annotation for StarryNite. In the process, we also provide some general methodologies for developing and validating a classifier with respect to a given pattern recognition task. BioMed Central 2010-02-11 /pmc/articles/PMC2838868/ /pubmed/20146825 http://dx.doi.org/10.1186/1471-2105-11-84 Text en Copyright ©2010 Aydin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Aydin, Zafer
Murray, John I
Waterston, Robert H
Noble, William S
Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title_full Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title_fullStr Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title_full_unstemmed Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title_short Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
title_sort using machine learning to speed up manual image annotation: application to a 3d imaging protocol for measuring single cell gene expression in the developing c. elegans embryo
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2838868/
https://www.ncbi.nlm.nih.gov/pubmed/20146825
http://dx.doi.org/10.1186/1471-2105-11-84
work_keys_str_mv AT aydinzafer usingmachinelearningtospeedupmanualimageannotationapplicationtoa3dimagingprotocolformeasuringsinglecellgeneexpressioninthedevelopingcelegansembryo
AT murrayjohni usingmachinelearningtospeedupmanualimageannotationapplicationtoa3dimagingprotocolformeasuringsinglecellgeneexpressioninthedevelopingcelegansembryo
AT waterstonroberth usingmachinelearningtospeedupmanualimageannotationapplicationtoa3dimagingprotocolformeasuringsinglecellgeneexpressioninthedevelopingcelegansembryo
AT noblewilliams usingmachinelearningtospeedupmanualimageannotationapplicationtoa3dimagingprotocolformeasuringsinglecellgeneexpressioninthedevelopingcelegansembryo