Cargando…
Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools
Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133286/ https://www.ncbi.nlm.nih.gov/pubmed/37100826 http://dx.doi.org/10.1038/s41598-023-32293-1 |
_version_ | 1785031535530868736 |
---|---|
author | Nguyen, Thanh Wei, Yuhua Nakada, Yuji Chen, Jake Y. Zhou, Yang Walcott, Gregory Zhang, Jianyi |
author_facet | Nguyen, Thanh Wei, Yuhua Nakada, Yuji Chen, Jake Y. Zhou, Yang Walcott, Gregory Zhang, Jianyi |
author_sort | Nguyen, Thanh |
collection | PubMed |
description | Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the currently available tools for processing and interpreting these immense datasets is limited. We incorporated three Artificial Intelligence (AI) techniques into a toolkit for evaluating scRNAseq data: AI Autoencoding separates data from different cell types and subpopulations of cell types (cluster analysis); AI Sparse Modeling identifies genes and signaling mechanisms that are differentially activated between subpopulations (pathway/gene set enrichment analysis), and AI Semisupervised Learning tracks the transformation of cells from one subpopulation into another (trajectory analysis). Autoencoding was often used in data denoising; yet, in our pipeline, Autoencoding was exclusively used for cell embedding and clustering. The performance of our AI scRNAseq toolkit and other highly cited non-AI tools was evaluated with three scRNAseq datasets obtained from the Gene Expression Omnibus database. Autoencoder was the only tool to identify differences between the cardiomyocyte subpopulations found in mice that underwent MI or sham-MI surgery on postnatal day (P) 1. Statistically significant differences between cardiomyocytes from P1-MI mice and mice that underwent MI on P8 were identified for six cell-cycle phases and five signaling pathways when the data were analyzed via Sparse Modeling, compared to just one cell-cycle phase and one pathway when the data were analyzed with non-AI techniques. Only Semisupervised Learning detected trajectories between the predominant cardiomyocyte clusters in hearts collected on P28 from pigs that underwent apical resection (AR) on P1, and on P30 from pigs that underwent AR on P1 and MI on P28. In another dataset, the pig scRNAseq data were collected after the injection of CCND2-overexpression Human-induced Pluripotent Stem Cell-derived cardiomyocytes ((CCND2)hiPSC) into injured P28 pig heart; only the AI-based technique could demonstrate that the host cardiomyocytes increase proliferating by through the HIPPO/YAP and MAPK signaling pathways. For the cluster, pathway/gene set enrichment, and trajectory analysis of scRNAseq datasets generated from studies of myocardial regeneration in mice and pigs, our AI-based toolkit identified results that non-AI techniques did not discover. These different results were validated and were important in explaining myocardial regeneration. |
format | Online Article Text |
id | pubmed-10133286 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-101332862023-04-28 Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools Nguyen, Thanh Wei, Yuhua Nakada, Yuji Chen, Jake Y. Zhou, Yang Walcott, Gregory Zhang, Jianyi Sci Rep Article Single-cell RNA sequencing (scRNAseq) enables researchers to identify and characterize populations and subpopulations of different cell types in hearts recovering from myocardial infarction (MI) by characterizing the transcriptomes in thousands of individual cells. However, the effectiveness of the currently available tools for processing and interpreting these immense datasets is limited. We incorporated three Artificial Intelligence (AI) techniques into a toolkit for evaluating scRNAseq data: AI Autoencoding separates data from different cell types and subpopulations of cell types (cluster analysis); AI Sparse Modeling identifies genes and signaling mechanisms that are differentially activated between subpopulations (pathway/gene set enrichment analysis), and AI Semisupervised Learning tracks the transformation of cells from one subpopulation into another (trajectory analysis). Autoencoding was often used in data denoising; yet, in our pipeline, Autoencoding was exclusively used for cell embedding and clustering. The performance of our AI scRNAseq toolkit and other highly cited non-AI tools was evaluated with three scRNAseq datasets obtained from the Gene Expression Omnibus database. Autoencoder was the only tool to identify differences between the cardiomyocyte subpopulations found in mice that underwent MI or sham-MI surgery on postnatal day (P) 1. Statistically significant differences between cardiomyocytes from P1-MI mice and mice that underwent MI on P8 were identified for six cell-cycle phases and five signaling pathways when the data were analyzed via Sparse Modeling, compared to just one cell-cycle phase and one pathway when the data were analyzed with non-AI techniques. Only Semisupervised Learning detected trajectories between the predominant cardiomyocyte clusters in hearts collected on P28 from pigs that underwent apical resection (AR) on P1, and on P30 from pigs that underwent AR on P1 and MI on P28. In another dataset, the pig scRNAseq data were collected after the injection of CCND2-overexpression Human-induced Pluripotent Stem Cell-derived cardiomyocytes ((CCND2)hiPSC) into injured P28 pig heart; only the AI-based technique could demonstrate that the host cardiomyocytes increase proliferating by through the HIPPO/YAP and MAPK signaling pathways. For the cluster, pathway/gene set enrichment, and trajectory analysis of scRNAseq datasets generated from studies of myocardial regeneration in mice and pigs, our AI-based toolkit identified results that non-AI techniques did not discover. These different results were validated and were important in explaining myocardial regeneration. Nature Publishing Group UK 2023-04-26 /pmc/articles/PMC10133286/ /pubmed/37100826 http://dx.doi.org/10.1038/s41598-023-32293-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Nguyen, Thanh Wei, Yuhua Nakada, Yuji Chen, Jake Y. Zhou, Yang Walcott, Gregory Zhang, Jianyi Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title | Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title_full | Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title_fullStr | Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title_full_unstemmed | Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title_short | Analysis of cardiac single-cell RNA-sequencing data can be improved by the use of artificial-intelligence-based tools |
title_sort | analysis of cardiac single-cell rna-sequencing data can be improved by the use of artificial-intelligence-based tools |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133286/ https://www.ncbi.nlm.nih.gov/pubmed/37100826 http://dx.doi.org/10.1038/s41598-023-32293-1 |
work_keys_str_mv | AT nguyenthanh analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT weiyuhua analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT nakadayuji analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT chenjakey analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT zhouyang analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT walcottgregory analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools AT zhangjianyi analysisofcardiacsinglecellrnasequencingdatacanbeimprovedbytheuseofartificialintelligencebasedtools |