Cargando…

An open-source solution for advanced imaging flow cytometry data analysis using machine learning

Imaging flow cytometry (IFC) enables the high throughput collection of morphological and spatial information from hundreds of thousands of single cells. This high content, information rich image data can in theory resolve important biological differences among complex, often heterogeneous biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Hennig, Holger, Rees, Paul, Blasi, Thomas, Kamentsky, Lee, Hung, Jane, Dao, David, Carpenter, Anne E., Filby, Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Academic Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5231320/
https://www.ncbi.nlm.nih.gov/pubmed/27594698
http://dx.doi.org/10.1016/j.ymeth.2016.08.018
_version_ 1782494463956877312
author Hennig, Holger
Rees, Paul
Blasi, Thomas
Kamentsky, Lee
Hung, Jane
Dao, David
Carpenter, Anne E.
Filby, Andrew
author_facet Hennig, Holger
Rees, Paul
Blasi, Thomas
Kamentsky, Lee
Hung, Jane
Dao, David
Carpenter, Anne E.
Filby, Andrew
author_sort Hennig, Holger
collection PubMed
description Imaging flow cytometry (IFC) enables the high throughput collection of morphological and spatial information from hundreds of thousands of single cells. This high content, information rich image data can in theory resolve important biological differences among complex, often heterogeneous biological samples. However, data analysis is often performed in a highly manual and subjective manner using very limited image analysis techniques in combination with conventional flow cytometry gating strategies. This approach is not scalable to the hundreds of available image-based features per cell and thus makes use of only a fraction of the spatial and morphometric information. As a result, the quality, reproducibility and rigour of results are limited by the skill, experience and ingenuity of the data analyst. Here, we describe a pipeline using open-source software that leverages the rich information in digital imagery using machine learning algorithms. Compensated and corrected raw image files (.rif) data files from an imaging flow cytometer (the proprietary .cif file format) are imported into the open-source software CellProfiler, where an image processing pipeline identifies cells and subcellular compartments allowing hundreds of morphological features to be measured. This high-dimensional data can then be analysed using cutting-edge machine learning and clustering approaches using “user-friendly” platforms such as CellProfiler Analyst. Researchers can train an automated cell classifier to recognize different cell types, cell cycle phases, drug treatment/control conditions, etc., using supervised machine learning. This workflow should enable the scientific community to leverage the full analytical power of IFC-derived data sets. It will help to reveal otherwise unappreciated populations of cells based on features that may be hidden to the human eye that include subtle measured differences in label free detection channels such as bright-field and dark-field imagery.
format Online
Article
Text
id pubmed-5231320
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Academic Press
record_format MEDLINE/PubMed
spelling pubmed-52313202017-01-23 An open-source solution for advanced imaging flow cytometry data analysis using machine learning Hennig, Holger Rees, Paul Blasi, Thomas Kamentsky, Lee Hung, Jane Dao, David Carpenter, Anne E. Filby, Andrew Methods Article Imaging flow cytometry (IFC) enables the high throughput collection of morphological and spatial information from hundreds of thousands of single cells. This high content, information rich image data can in theory resolve important biological differences among complex, often heterogeneous biological samples. However, data analysis is often performed in a highly manual and subjective manner using very limited image analysis techniques in combination with conventional flow cytometry gating strategies. This approach is not scalable to the hundreds of available image-based features per cell and thus makes use of only a fraction of the spatial and morphometric information. As a result, the quality, reproducibility and rigour of results are limited by the skill, experience and ingenuity of the data analyst. Here, we describe a pipeline using open-source software that leverages the rich information in digital imagery using machine learning algorithms. Compensated and corrected raw image files (.rif) data files from an imaging flow cytometer (the proprietary .cif file format) are imported into the open-source software CellProfiler, where an image processing pipeline identifies cells and subcellular compartments allowing hundreds of morphological features to be measured. This high-dimensional data can then be analysed using cutting-edge machine learning and clustering approaches using “user-friendly” platforms such as CellProfiler Analyst. Researchers can train an automated cell classifier to recognize different cell types, cell cycle phases, drug treatment/control conditions, etc., using supervised machine learning. This workflow should enable the scientific community to leverage the full analytical power of IFC-derived data sets. It will help to reveal otherwise unappreciated populations of cells based on features that may be hidden to the human eye that include subtle measured differences in label free detection channels such as bright-field and dark-field imagery. Academic Press 2017-01-01 /pmc/articles/PMC5231320/ /pubmed/27594698 http://dx.doi.org/10.1016/j.ymeth.2016.08.018 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hennig, Holger
Rees, Paul
Blasi, Thomas
Kamentsky, Lee
Hung, Jane
Dao, David
Carpenter, Anne E.
Filby, Andrew
An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title_full An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title_fullStr An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title_full_unstemmed An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title_short An open-source solution for advanced imaging flow cytometry data analysis using machine learning
title_sort open-source solution for advanced imaging flow cytometry data analysis using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5231320/
https://www.ncbi.nlm.nih.gov/pubmed/27594698
http://dx.doi.org/10.1016/j.ymeth.2016.08.018
work_keys_str_mv AT hennigholger anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT reespaul anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT blasithomas anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT kamentskylee anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT hungjane anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT daodavid anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT carpenterannee anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT filbyandrew anopensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT hennigholger opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT reespaul opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT blasithomas opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT kamentskylee opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT hungjane opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT daodavid opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT carpenterannee opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning
AT filbyandrew opensourcesolutionforadvancedimagingflowcytometrydataanalysisusingmachinelearning