Cargando…
Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model
Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927983/ https://www.ncbi.nlm.nih.gov/pubmed/35310659 http://dx.doi.org/10.3389/fpls.2022.831204 |
_version_ | 1784670562463776768 |
---|---|
author | Su, Lingtao Xu, Chunhui Zeng, Shuai Su, Li Joshi, Trupti Stacey, Gary Xu, Dong |
author_facet | Su, Lingtao Xu, Chunhui Zeng, Shuai Su, Li Joshi, Trupti Stacey, Gary Xu, Dong |
author_sort | Su, Lingtao |
collection | PubMed |
description | Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in a uniform analysis pipeline. To address the gene expression heterogeneity in different tissues, we utilized an adversarial deconfounding autoencoder (AD-AE) model to map gene expressions into a latent space and adapted a standard unsupervised autoencoder (AE) model to help effectively extract meaningful biological signals from the noisy data. As a result, four groups of 1,743, 914, 2,107, and 1,451 genes were found highly expressed specifically in leaf, root, seed and nodule tissues, respectively. To obtain key transcription factors (TFs), hub genes and their functional modules in each tissue, we constructed tissue-specific gene regulatory networks (GRNs), and differential correlation networks by using corrected and compressed gene expression data. We validated our results from the literature and gene enrichment analysis, which confirmed many identified tissue-specific genes. Our study represents the largest gene expression analysis in soybean tissues to date. It provides valuable targets for tissue-specific research and helps uncover broader biological patterns. Code is publicly available with open source at https://github.com/LingtaoSu/SoyMeta. |
format | Online Article Text |
id | pubmed-8927983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89279832022-03-18 Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model Su, Lingtao Xu, Chunhui Zeng, Shuai Su, Li Joshi, Trupti Stacey, Gary Xu, Dong Front Plant Sci Plant Science Plant tissues are distinguished by their gene expression patterns, which can help identify tissue-specific highly expressed genes and their differential functional modules. For this purpose, large-scale soybean transcriptome samples were collected and processed starting from raw sequencing reads in a uniform analysis pipeline. To address the gene expression heterogeneity in different tissues, we utilized an adversarial deconfounding autoencoder (AD-AE) model to map gene expressions into a latent space and adapted a standard unsupervised autoencoder (AE) model to help effectively extract meaningful biological signals from the noisy data. As a result, four groups of 1,743, 914, 2,107, and 1,451 genes were found highly expressed specifically in leaf, root, seed and nodule tissues, respectively. To obtain key transcription factors (TFs), hub genes and their functional modules in each tissue, we constructed tissue-specific gene regulatory networks (GRNs), and differential correlation networks by using corrected and compressed gene expression data. We validated our results from the literature and gene enrichment analysis, which confirmed many identified tissue-specific genes. Our study represents the largest gene expression analysis in soybean tissues to date. It provides valuable targets for tissue-specific research and helps uncover broader biological patterns. Code is publicly available with open source at https://github.com/LingtaoSu/SoyMeta. Frontiers Media S.A. 2022-03-03 /pmc/articles/PMC8927983/ /pubmed/35310659 http://dx.doi.org/10.3389/fpls.2022.831204 Text en Copyright © 2022 Su, Xu, Zeng, Su, Joshi, Stacey and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Su, Lingtao Xu, Chunhui Zeng, Shuai Su, Li Joshi, Trupti Stacey, Gary Xu, Dong Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title | Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title_full | Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title_fullStr | Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title_full_unstemmed | Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title_short | Large-Scale Integrative Analysis of Soybean Transcriptome Using an Unsupervised Autoencoder Model |
title_sort | large-scale integrative analysis of soybean transcriptome using an unsupervised autoencoder model |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927983/ https://www.ncbi.nlm.nih.gov/pubmed/35310659 http://dx.doi.org/10.3389/fpls.2022.831204 |
work_keys_str_mv | AT sulingtao largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT xuchunhui largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT zengshuai largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT suli largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT joshitrupti largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT staceygary largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel AT xudong largescaleintegrativeanalysisofsoybeantranscriptomeusinganunsupervisedautoencodermodel |