Cargando…

Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools

Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Thanh, Wei, Yuhua, Nakada, Yuji, Chen, Jake Y., Zhou, Yang, Walcott, Gregory, Zhang, Jianyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133286/
https://www.ncbi.nlm.nih.gov/pubmed/37100826
http://dx.doi.org/10.1038/s41598-023-32293-1
_version_ 1785031535530868736
author Nguyen, Thanh
Wei, Yuhua
Nakada, Yuji
Chen, Jake Y.
Zhou, Yang
Walcott, Gregory
Zhang, Jianyi
author_facet Nguyen, Thanh
Wei, Yuhua
Nakada, Yuji
Chen, Jake Y.
Zhou, Yang
Walcott, Gregory
Zhang, Jianyi
author_sort Nguyen, Thanh
collection PubMed
description Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the currently available tools for processing and interpreting these immense datasets is limited. We incorporated three Artificial Intelligence (AI) techniques into a toolkit for evaluating scRNAseq data: AI Autoencoding separates data from different cell types and subpopulations of cell types (cluster analysis); AI Sparse Modeling identifies genes and signaling mechanisms that are differentially activated between subpopulations (pathway/gene set enrichment analysis), and AI Semisupervised Learning tracks the transformation of cells from one subpopulation into another (trajectory analysis). Autoencoding was often used in data denoising; yet, in our pipeline, Autoencoding was exclusively used for cell embedding and clustering. The performance of our AI scRNAseq toolkit and other highly cited non-AI tools was evaluated with three scRNAseq datasets obtained from the Gene Expression Omnibus database. Autoencoder was the only tool to identify differences between the cardiomyocyte subpopulations found in mice that underwent MI or sham-MI surgery on postnatal day (P) 1. Statistically significant differences between cardiomyocytes from P1-MI mice and mice that underwent MI on P8 were identified for six cell-cycle phases and five signaling pathways when the data were analyzed via Sparse Modeling, compared to just one cell-cycle phase and one pathway when the data were analyzed with non-AI techniques. Only Semisupervised Learning detected trajectories between the predominant cardiomyocyte clusters in hearts collected on P28 from pigs that underwent apical resection (AR) on P1, and on P30 from pigs that underwent AR on P1 and MI on P28. In another dataset, the pig scRNAseq data were collected after the injection of CCND2-overexpression Human-induced Pluripotent Stem Cell-derived cardiomyocytes ((CCND2)hiPSC) into injured P28 pig heart; only the AI-based technique could demonstrate that the host cardiomyocytes increase proliferating by through the HIPPO/YAP and MAPK signaling pathways. For the cluster, pathway/gene set enrichment, and trajectory analysis of scRNAseq datasets generated from studies of myocardial regeneration in mice and pigs, our AI-based toolkit identified results that non-AI techniques did not discover. These different results were validated and were important in explaining myocardial regeneration.
format Online
Article
Text
id pubmed-10133286
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101332862023-04-28 Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools Nguyen, Thanh Wei, Yuhua Nakada, Yuji Chen, Jake Y. Zhou, Yang Walcott, Gregory Zhang, Jianyi Sci Rep Article Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the currently available tools for processing and interpreting these immense datasets is limited. We incorporated three Artificial Intelligence (AI) techniques into a toolkit for evaluating scRNAseq data: AI Autoencoding separates data from different cell types and subpopulations of cell types (cluster analysis); AI Sparse Modeling identifies genes and signaling mechanisms that are differentially activated between subpopulations (pathway/gene set enrichment analysis), and AI Semisupervised Learning tracks the transformation of cells from one subpopulation into another (trajectory analysis). Autoencoding was often used in data denoising; yet, in our pipeline, Autoencoding was exclusively used for cell embedding and clustering. The performance of our AI scRNAseq toolkit and other highly cited non-AI tools was evaluated with three scRNAseq datasets obtained from the Gene Expression Omnibus database. Autoencoder was the only tool to identify differences between the cardiomyocyte subpopulations found in mice that underwent MI or sham-MI surgery on postnatal day (P) 1. Statistically significant differences between cardiomyocytes from P1-MI mice and mice that underwent MI on P8 were identified for six cell-cycle phases and five signaling pathways when the data were analyzed via Sparse Modeling, compared to just one cell-cycle phase and one pathway when the data were analyzed with non-AI techniques. Only Semisupervised Learning detected trajectories between the predominant cardiomyocyte clusters in hearts collected on P28 from pigs that underwent apical resection (AR) on P1, and on P30 from pigs that underwent AR on P1 and MI on P28. In another dataset, the pig scRNAseq data were collected after the injection of CCND2-overexpression Human-induced Pluripotent Stem Cell-derived cardiomyocytes ((CCND2)hiPSC) into injured P28 pig heart; only the AI-based technique could demonstrate that the host cardiomyocytes increase proliferating by through the HIPPO/YAP and MAPK signaling pathways. For the cluster, pathway/gene set enrichment, and trajectory analysis of scRNAseq datasets generated from studies of myocardial regeneration in mice and pigs, our AI-based toolkit identified results that non-AI techniques did not discover. These different results were validated and were important in explaining myocardial regeneration. Nature Publishing Group UK 2023-04-26 /pmc/articles/PMC10133286/ /pubmed/37100826 http://dx.doi.org/10.1038/s41598-023-32293-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Nguyen, Thanh
Wei, Yuhua
Nakada, Yuji
Chen, Jake Y.
Zhou, Yang
Walcott, Gregory
Zhang, Jianyi
Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title_full Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title_fullStr Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title_full_unstemmed Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title_short Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
title_sort analysis of cardiac single-cell rna-sequencing data can be improved by the use of artificial-intelligence-based tools
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133286/
https://www.ncbi.nlm.nih.gov/pubmed/37100826
http://dx.doi.org/10.1038/s41598-023-32293-1
work_keys_str_mv AT nguyenthanh analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT weiyuhua analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT nakadayuji analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT chenjakey analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT zhouyang analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT walcottgregory analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools
AT zhangjianyi analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools