Cargando…

Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model

Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Lingtao, Xu, Chunhui, Zeng, Shuai, Su, Li, Joshi, Trupti, Stacey, Gary, Xu, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927983/
https://www.ncbi.nlm.nih.gov/pubmed/35310659
http://dx.doi.org/10.3389/fpls.2022.831204
_version_ 1784670562463776768
author Su, Lingtao
Xu, Chunhui
Zeng, Shuai
Su, Li
Joshi, Trupti
Stacey, Gary
Xu, Dong
author_facet Su, Lingtao
Xu, Chunhui
Zeng, Shuai
Su, Li
Joshi, Trupti
Stacey, Gary
Xu, Dong
author_sort Su, Lingtao
collection PubMed
description Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in a uniform analysis pipeline. To address the gene expression heterogeneity in different tissues, we utilized an adversarial deconfounding autoencoder (AD-AE) model to map gene expressions into a latent space and adapted a standard unsupervised autoencoder (AE) model to help effectively extract meaningful biological signals from the noisy data. As a result, four groups of 1,743, 914, 2,107, and 1,451 genes were found highly expressed specifically in leaf, root, seed and nodule tissues, respectively. To obtain key transcription factors (TFs), hub genes and their functional modules in each tissue, we constructed tissue-specific gene regulatory networks (GRNs), and differential correlation networks by using corrected and compressed gene expression data. We validated our results from the literature and gene enrichment analysis, which confirmed many identified tissue-specific genes. Our study represents the largest gene expression analysis in soybean tissues to date. It provides valuable targets for tissue-specific research and helps uncover broader biological patterns. Code is publicly available with open source at https://github.com/LingtaoSu/SoyMeta.
format Online
Article
Text
id pubmed-8927983
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89279832022-03-18 Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model Su, Lingtao Xu, Chunhui Zeng, Shuai Su, Li Joshi, Trupti Stacey, Gary Xu, Dong Front Plant Sci Plant Science Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in a uniform analysis pipeline. To address the gene expression heterogeneity in different tissues, we utilized an adversarial deconfounding autoencoder (AD-AE) model to map gene expressions into a latent space and adapted a standard unsupervised autoencoder (AE) model to help effectively extract meaningful biological signals from the noisy data. As a result, four groups of 1,743, 914, 2,107, and 1,451 genes were found highly expressed specifically in leaf, root, seed and nodule tissues, respectively. To obtain key transcription factors (TFs), hub genes and their functional modules in each tissue, we constructed tissue-specific gene regulatory networks (GRNs), and differential correlation networks by using corrected and compressed gene expression data. We validated our results from the literature and gene enrichment analysis, which confirmed many identified tissue-specific genes. Our study represents the largest gene expression analysis in soybean tissues to date. It provides valuable targets for tissue-specific research and helps uncover broader biological patterns. Code is publicly available with open source at https://github.com/LingtaoSu/SoyMeta. Frontiers Media S.A. 2022-03-03 /pmc/articles/PMC8927983/ /pubmed/35310659 http://dx.doi.org/10.3389/fpls.2022.831204 Text en Copyright © 2022 Su, Xu, Zeng, Su, Joshi, Stacey and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Su, Lingtao
Xu, Chunhui
Zeng, Shuai
Su, Li
Joshi, Trupti
Stacey, Gary
Xu, Dong
Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title_full Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title_fullStr Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title_full_unstemmed Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title_short Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
title_sort large-scale integrative analysis of soybean transcriptome using an unsupervised autoencoder model
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927983/
https://www.ncbi.nlm.nih.gov/pubmed/35310659
http://dx.doi.org/10.3389/fpls.2022.831204
work_keys_str_mv AT sulingtao largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT xuchunhui largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT zengshuai largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT suli largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT joshitrupti largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT staceygary largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel
AT xudong largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel