Cargando…

A novel machine learning based approach for iPS progenitor cell identification

Identification of induced pluripotent stem (iPS) progenitor cells, the iPS forming cells in early stage of reprogramming, could provide valuable information for studying the origin and underlying mechanism of iPS cells. However, it is very difficult to identify experimentally since there are no biom...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Haishan, Shao, Ximing, Peng, Yin, Teng, Yanning, Saravanan, Konda Mani, Zhang, Huiling, Li, Hongchang, Wei, Yanjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6932749/
https://www.ncbi.nlm.nih.gov/pubmed/31877128
http://dx.doi.org/10.1371/journal.pcbi.1007351
_version_ 1783483067082473472
author Zhang, Haishan
Shao, Ximing
Peng, Yin
Teng, Yanning
Saravanan, Konda Mani
Zhang, Huiling
Li, Hongchang
Wei, Yanjie
author_facet Zhang, Haishan
Shao, Ximing
Peng, Yin
Teng, Yanning
Saravanan, Konda Mani
Zhang, Huiling
Li, Hongchang
Wei, Yanjie
author_sort Zhang, Haishan
collection PubMed
description Identification of induced pluripotent stem (iPS) progenitor cells, the iPS forming cells in early stage of reprogramming, could provide valuable information for studying the origin and underlying mechanism of iPS cells. However, it is very difficult to identify experimentally since there are no biomarkers known for early progenitor cells, and only about 6 days after reprogramming initiation, iPS cells can be experimentally determined via fluorescent probes. What is more, the ratio of progenitor cells during early reprograming period is below 5%, which is too low to capture experimentally in the early stage. In this paper, we propose a novel computational approach for the identification of iPS progenitor cells based on machine learning and microscopic image analysis. Firstly, we record the reprogramming process using a live cell imaging system after 48 hours of infection with retroviruses expressing Oct4, Sox2 and Klf4, later iPS progenitor cells and normal murine embryonic fibroblasts (MEFs) within 3 to 5 days after infection are labeled by retrospectively tracing the time-lapse microscopic image. We then calculate 11 types of cell morphological and motion features such as area, speed, etc., and select best time windows for modeling and perform feature selection. Finally, a prediction model using XGBoost is built based on the selected six types of features and best time windows. Our model allows several missing values/frames in the sample datasets, thus it is applicable to a wide range of scenarios. Cross-validation, holdout validation and independent test experiments show that the minimum precision is above 52%, that is, the ratio of predicted progenitor cells within 3 to 5 days after viral infection is above 52%. The results also confirm that the morphology and motion pattern of iPS progenitor cells is different from that of normal MEFs, which helps with the machine learning methods for iPS progenitor cell identification.
format Online
Article
Text
id pubmed-6932749
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69327492020-01-07 A novel machine learning based approach for iPS progenitor cell identification Zhang, Haishan Shao, Ximing Peng, Yin Teng, Yanning Saravanan, Konda Mani Zhang, Huiling Li, Hongchang Wei, Yanjie PLoS Comput Biol Research Article Identification of induced pluripotent stem (iPS) progenitor cells, the iPS forming cells in early stage of reprogramming, could provide valuable information for studying the origin and underlying mechanism of iPS cells. However, it is very difficult to identify experimentally since there are no biomarkers known for early progenitor cells, and only about 6 days after reprogramming initiation, iPS cells can be experimentally determined via fluorescent probes. What is more, the ratio of progenitor cells during early reprograming period is below 5%, which is too low to capture experimentally in the early stage. In this paper, we propose a novel computational approach for the identification of iPS progenitor cells based on machine learning and microscopic image analysis. Firstly, we record the reprogramming process using a live cell imaging system after 48 hours of infection with retroviruses expressing Oct4, Sox2 and Klf4, later iPS progenitor cells and normal murine embryonic fibroblasts (MEFs) within 3 to 5 days after infection are labeled by retrospectively tracing the time-lapse microscopic image. We then calculate 11 types of cell morphological and motion features such as area, speed, etc., and select best time windows for modeling and perform feature selection. Finally, a prediction model using XGBoost is built based on the selected six types of features and best time windows. Our model allows several missing values/frames in the sample datasets, thus it is applicable to a wide range of scenarios. Cross-validation, holdout validation and independent test experiments show that the minimum precision is above 52%, that is, the ratio of predicted progenitor cells within 3 to 5 days after viral infection is above 52%. The results also confirm that the morphology and motion pattern of iPS progenitor cells is different from that of normal MEFs, which helps with the machine learning methods for iPS progenitor cell identification. Public Library of Science 2019-12-26 /pmc/articles/PMC6932749/ /pubmed/31877128 http://dx.doi.org/10.1371/journal.pcbi.1007351 Text en © 2019 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Haishan
Shao, Ximing
Peng, Yin
Teng, Yanning
Saravanan, Konda Mani
Zhang, Huiling
Li, Hongchang
Wei, Yanjie
A novel machine learning based approach for iPS progenitor cell identification
title A novel machine learning based approach for iPS progenitor cell identification
title_full A novel machine learning based approach for iPS progenitor cell identification
title_fullStr A novel machine learning based approach for iPS progenitor cell identification
title_full_unstemmed A novel machine learning based approach for iPS progenitor cell identification
title_short A novel machine learning based approach for iPS progenitor cell identification
title_sort novel machine learning based approach for ips progenitor cell identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6932749/
https://www.ncbi.nlm.nih.gov/pubmed/31877128
http://dx.doi.org/10.1371/journal.pcbi.1007351
work_keys_str_mv AT zhanghaishan anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT shaoximing anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT pengyin anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT tengyanning anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT saravanankondamani anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT zhanghuiling anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT lihongchang anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT weiyanjie anovelmachinelearningbasedapproachforipsprogenitorcellidentification
AT zhanghaishan novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT shaoximing novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT pengyin novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT tengyanning novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT saravanankondamani novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT zhanghuiling novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT lihongchang novelmachinelearningbasedapproachforipsprogenitorcellidentification
AT weiyanjie novelmachinelearningbasedapproachforipsprogenitorcellidentification