Cargando…

Cell morphology-based machine learning models for human cell state classification

Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yi, Nowak, Chance M., Pham, Uyen, Nguyen, Khai, Bleris, Leonidas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155075/
https://www.ncbi.nlm.nih.gov/pubmed/34039992
http://dx.doi.org/10.1038/s41540-021-00180-y
_version_ 1783699133735895040
author Li, Yi
Nowak, Chance M.
Pham, Uyen
Nguyen, Khai
Bleris, Leonidas
author_facet Li, Yi
Nowak, Chance M.
Pham, Uyen
Nguyen, Khai
Bleris, Leonidas
author_sort Li, Yi
collection PubMed
description Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information.
format Online
Article
Text
id pubmed-8155075
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81550752021-06-10 Cell morphology-based machine learning models for human cell state classification Li, Yi Nowak, Chance M. Pham, Uyen Nguyen, Khai Bleris, Leonidas NPJ Syst Biol Appl Article Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information. Nature Publishing Group UK 2021-05-26 /pmc/articles/PMC8155075/ /pubmed/34039992 http://dx.doi.org/10.1038/s41540-021-00180-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Yi
Nowak, Chance M.
Pham, Uyen
Nguyen, Khai
Bleris, Leonidas
Cell morphology-based machine learning models for human cell state classification
title Cell morphology-based machine learning models for human cell state classification
title_full Cell morphology-based machine learning models for human cell state classification
title_fullStr Cell morphology-based machine learning models for human cell state classification
title_full_unstemmed Cell morphology-based machine learning models for human cell state classification
title_short Cell morphology-based machine learning models for human cell state classification
title_sort cell morphology-based machine learning models for human cell state classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155075/
https://www.ncbi.nlm.nih.gov/pubmed/34039992
http://dx.doi.org/10.1038/s41540-021-00180-y
work_keys_str_mv AT liyi cellmorphologybasedmachinelearningmodelsforhumancellstateclassification
AT nowakchancem cellmorphologybasedmachinelearningmodelsforhumancellstateclassification
AT phamuyen cellmorphologybasedmachinelearningmodelsforhumancellstateclassification
AT nguyenkhai cellmorphologybasedmachinelearningmodelsforhumancellstateclassification
AT blerisleonidas cellmorphologybasedmachinelearningmodelsforhumancellstateclassification