Cargando…

Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images

BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Allison Chia-Yi, Rifkin, Scott A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4450985/
https://www.ncbi.nlm.nih.gov/pubmed/25880543
http://dx.doi.org/10.1186/s12859-015-0534-z
_version_ 1782374078569512960
author Wu, Allison Chia-Yi
Rifkin, Scott A
author_facet Wu, Allison Chia-Yi
Rifkin, Scott A
author_sort Wu, Allison Chia-Yi
collection PubMed
description BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it can be difficult to distinguish a fluorescently labeled single molecule from background speckle. RESULTS: We present a computational pipeline to distinguish the true signal of fluorescently labeled molecules from background fluorescence and noise. We test our technique using the challenging case of wide-field, epifluorescence microscope image stacks from single molecule fluorescence in situ experiments on nematode embryos where there can be substantial out-of-focus light and structured noise. The software recognizes and classifies individual mRNA spots by measuring several features of local intensity maxima and classifying them with a supervised random forest classifier. A key innovation of this software is that, by estimating the probability that each local maximum is a true spot in a statistically principled way, it makes it possible to estimate the error introduced by image classification. This can be used to assess the quality of the data and to estimate a confidence interval for the molecule count estimate, all of which are important for quantitative interpretations of the results of single-molecule experiments. CONCLUSIONS: The software classifies spots in these images well, with >95% AUROC on realistic artificial data and outperforms other commonly used techniques on challenging real data. Its interval estimates provide a unique measure of the quality of an image and confidence in the classification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0534-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4450985
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44509852015-06-02 Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images Wu, Allison Chia-Yi Rifkin, Scott A BMC Bioinformatics Software BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it can be difficult to distinguish a fluorescently labeled single molecule from background speckle. RESULTS: We present a computational pipeline to distinguish the true signal of fluorescently labeled molecules from background fluorescence and noise. We test our technique using the challenging case of wide-field, epifluorescence microscope image stacks from single molecule fluorescence in situ experiments on nematode embryos where there can be substantial out-of-focus light and structured noise. The software recognizes and classifies individual mRNA spots by measuring several features of local intensity maxima and classifying them with a supervised random forest classifier. A key innovation of this software is that, by estimating the probability that each local maximum is a true spot in a statistically principled way, it makes it possible to estimate the error introduced by image classification. This can be used to assess the quality of the data and to estimate a confidence interval for the molecule count estimate, all of which are important for quantitative interpretations of the results of single-molecule experiments. CONCLUSIONS: The software classifies spots in these images well, with >95% AUROC on realistic artificial data and outperforms other commonly used techniques on challenging real data. Its interval estimates provide a unique measure of the quality of an image and confidence in the classification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0534-z) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-27 /pmc/articles/PMC4450985/ /pubmed/25880543 http://dx.doi.org/10.1186/s12859-015-0534-z Text en © Wu and Rifkin; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Wu, Allison Chia-Yi
Rifkin, Scott A
Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title_full Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title_fullStr Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title_full_unstemmed Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title_short Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
title_sort aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4450985/
https://www.ncbi.nlm.nih.gov/pubmed/25880543
http://dx.doi.org/10.1186/s12859-015-0534-z
work_keys_str_mv AT wuallisonchiayi aroamachinelearningapproachtoidentifyingsinglemoleculesandestimatingclassificationerrorinfluorescencemicroscopyimages
AT rifkinscotta aroamachinelearningapproachtoidentifyingsinglemoleculesandestimatingclassificationerrorinfluorescencemicroscopyimages