Cargando…
Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images
BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4450985/ https://www.ncbi.nlm.nih.gov/pubmed/25880543 http://dx.doi.org/10.1186/s12859-015-0534-z |
_version_ | 1782374078569512960 |
---|---|
author | Wu, Allison Chia-Yi Rifkin, Scott A |
author_facet | Wu, Allison Chia-Yi Rifkin, Scott A |
author_sort | Wu, Allison Chia-Yi |
collection | PubMed |
description | BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it can be difficult to distinguish a fluorescently labeled single molecule from background speckle. RESULTS: We present a computational pipeline to distinguish the true signal of fluorescently labeled molecules from background fluorescence and noise. We test our technique using the challenging case of wide-field, epifluorescence microscope image stacks from single molecule fluorescence in situ experiments on nematode embryos where there can be substantial out-of-focus light and structured noise. The software recognizes and classifies individual mRNA spots by measuring several features of local intensity maxima and classifying them with a supervised random forest classifier. A key innovation of this software is that, by estimating the probability that each local maximum is a true spot in a statistically principled way, it makes it possible to estimate the error introduced by image classification. This can be used to assess the quality of the data and to estimate a confidence interval for the molecule count estimate, all of which are important for quantitative interpretations of the results of single-molecule experiments. CONCLUSIONS: The software classifies spots in these images well, with >95% AUROC on realistic artificial data and outperforms other commonly used techniques on challenging real data. Its interval estimates provide a unique measure of the quality of an image and confidence in the classification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0534-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4450985 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44509852015-06-02 Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images Wu, Allison Chia-Yi Rifkin, Scott A BMC Bioinformatics Software BACKGROUND: Recent techniques for tagging and visualizing single molecules in fixed or living organisms and cell lines have been revolutionizing our understanding of the spatial and temporal dynamics of fundamental biological processes. However, fluorescence microscopy images are often noisy, and it can be difficult to distinguish a fluorescently labeled single molecule from background speckle. RESULTS: We present a computational pipeline to distinguish the true signal of fluorescently labeled molecules from background fluorescence and noise. We test our technique using the challenging case of wide-field, epifluorescence microscope image stacks from single molecule fluorescence in situ experiments on nematode embryos where there can be substantial out-of-focus light and structured noise. The software recognizes and classifies individual mRNA spots by measuring several features of local intensity maxima and classifying them with a supervised random forest classifier. A key innovation of this software is that, by estimating the probability that each local maximum is a true spot in a statistically principled way, it makes it possible to estimate the error introduced by image classification. This can be used to assess the quality of the data and to estimate a confidence interval for the molecule count estimate, all of which are important for quantitative interpretations of the results of single-molecule experiments. CONCLUSIONS: The software classifies spots in these images well, with >95% AUROC on realistic artificial data and outperforms other commonly used techniques on challenging real data. Its interval estimates provide a unique measure of the quality of an image and confidence in the classification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0534-z) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-27 /pmc/articles/PMC4450985/ /pubmed/25880543 http://dx.doi.org/10.1186/s12859-015-0534-z Text en © Wu and Rifkin; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Wu, Allison Chia-Yi Rifkin, Scott A Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title | Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title_full | Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title_fullStr | Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title_full_unstemmed | Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title_short | Aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
title_sort | aro: a machine learning approach to identifying single molecules and estimating classification error in fluorescence microscopy images |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4450985/ https://www.ncbi.nlm.nih.gov/pubmed/25880543 http://dx.doi.org/10.1186/s12859-015-0534-z |
work_keys_str_mv | AT wuallisonchiayi aroamachinelearningapproachtoidentifyingsinglemoleculesandestimatingclassificationerrorinfluorescencemicroscopyimages AT rifkinscotta aroamachinelearningapproachtoidentifyingsinglemoleculesandestimatingclassificationerrorinfluorescencemicroscopyimages |