Cargando…

Uncovering the key dimensions of high-throughput biomolecular data using deep learning

Recent advances in high-throughput single-cell RNA-seq have enabled us to measure thousands of gene expression levels at single-cell resolution. However, the transcriptomic profiles are high-dimensional and sparse in nature. To address it, a deep learning framework based on auto-encoder, termed Deep...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shixiong, Li, Xiangtao, Lin, Qiuzhen, Lin, Jiecong, Wong, Ka-Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7261195/
https://www.ncbi.nlm.nih.gov/pubmed/32232416
http://dx.doi.org/10.1093/nar/gkaa191
_version_ 1783540462434385920
author Zhang, Shixiong
Li, Xiangtao
Lin, Qiuzhen
Lin, Jiecong
Wong, Ka-Chun
author_facet Zhang, Shixiong
Li, Xiangtao
Lin, Qiuzhen
Lin, Jiecong
Wong, Ka-Chun
author_sort Zhang, Shixiong
collection PubMed
description Recent advances in high-throughput single-cell RNA-seq have enabled us to measure thousands of gene expression levels at single-cell resolution. However, the transcriptomic profiles are high-dimensional and sparse in nature. To address it, a deep learning framework based on auto-encoder, termed DeepAE, is proposed to elucidate high-dimensional transcriptomic profiling data in an encode–decode manner. Comparative experiments were conducted on nine transcriptomic profiling datasets to compare DeepAE with four benchmark methods. The results demonstrate that the proposed DeepAE outperforms the benchmark methods with robust performance on uncovering the key dimensions of single-cell RNA-seq data. In addition, we also investigate the performance of DeepAE in other contexts and platforms such as mass cytometry and metabolic profiling in a comprehensive manner. Gene ontology enrichment and pathology analysis are conducted to reveal the mechanisms behind the robust performance of DeepAE by uncovering its key dimensions.
format Online
Article
Text
id pubmed-7261195
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72611952020-06-03 Uncovering the key dimensions of high-throughput biomolecular data using deep learning Zhang, Shixiong Li, Xiangtao Lin, Qiuzhen Lin, Jiecong Wong, Ka-Chun Nucleic Acids Res Methods Online Recent advances in high-throughput single-cell RNA-seq have enabled us to measure thousands of gene expression levels at single-cell resolution. However, the transcriptomic profiles are high-dimensional and sparse in nature. To address it, a deep learning framework based on auto-encoder, termed DeepAE, is proposed to elucidate high-dimensional transcriptomic profiling data in an encode–decode manner. Comparative experiments were conducted on nine transcriptomic profiling datasets to compare DeepAE with four benchmark methods. The results demonstrate that the proposed DeepAE outperforms the benchmark methods with robust performance on uncovering the key dimensions of single-cell RNA-seq data. In addition, we also investigate the performance of DeepAE in other contexts and platforms such as mass cytometry and metabolic profiling in a comprehensive manner. Gene ontology enrichment and pathology analysis are conducted to reveal the mechanisms behind the robust performance of DeepAE by uncovering its key dimensions. Oxford University Press 2020-06-04 2020-03-31 /pmc/articles/PMC7261195/ /pubmed/32232416 http://dx.doi.org/10.1093/nar/gkaa191 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Zhang, Shixiong
Li, Xiangtao
Lin, Qiuzhen
Lin, Jiecong
Wong, Ka-Chun
Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title_full Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title_fullStr Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title_full_unstemmed Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title_short Uncovering the key dimensions of high-throughput biomolecular data using deep learning
title_sort uncovering the key dimensions of high-throughput biomolecular data using deep learning
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7261195/
https://www.ncbi.nlm.nih.gov/pubmed/32232416
http://dx.doi.org/10.1093/nar/gkaa191
work_keys_str_mv AT zhangshixiong uncoveringthekeydimensionsofhighthroughputbiomoleculardatausingdeeplearning
AT lixiangtao uncoveringthekeydimensionsofhighthroughputbiomoleculardatausingdeeplearning
AT linqiuzhen uncoveringthekeydimensionsofhighthroughputbiomoleculardatausingdeeplearning
AT linjiecong uncoveringthekeydimensionsofhighthroughputbiomoleculardatausingdeeplearning
AT wongkachun uncoveringthekeydimensionsofhighthroughputbiomoleculardatausingdeeplearning