Cargando…

Machine learning identifies abnormal Ca(2+)transients in human induced pluripotent stem cell-derived cardiomyocytes

Human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) provide an excellent platform for potential clinical and research applications. Identifying abnormal Ca(2+) transients is crucial for evaluating cardiomyocyte function that requires labor-intensive manual effort. Therefore, we de...

Descripción completa

Detalles Bibliográficos
Autores principales: Hwang, Hyun, Liu, Rui, Maxwell, Joshua T., Yang, Jingjing, Xu, Chunhui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7550597/
https://www.ncbi.nlm.nih.gov/pubmed/33046816
http://dx.doi.org/10.1038/s41598-020-73801-x
Descripción
Sumario:Human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) provide an excellent platform for potential clinical and research applications. Identifying abnormal Ca(2+) transients is crucial for evaluating cardiomyocyte function that requires labor-intensive manual effort. Therefore, we develop an analytical pipeline for automatic assessment of Ca(2+) transient abnormality, by employing advanced machine learning methods together with an Analytical Algorithm. First, we adapt an existing Analytical Algorithm to identify Ca(2+) transient peaks and determine peak abnormality based on quantified peak characteristics. Second, we train a peak-level Support Vector Machine (SVM) classifier by using human-expert assessment of peak abnormality as outcome and profiled peak variables as predictive features. Third, we train another cell-level SVM classifier by using human-expert assessment of cell abnormality as outcome and quantified cell-level variables as predictive features. This cell-level SVM classifier can be used to assess additional Ca(2+) transient signals. By applying this pipeline to our Ca(2+) transient data, we trained a cell-level SVM classifier using 200 cells as training data, then tested its accuracy in an independent dataset of 54 cells. As a result, we obtained 88% training accuracy and 87% test accuracy. Further, we provide a free R package to implement our pipeline for high-throughput CM Ca(2+) analysis.