Cargando…
A benchmark study of deep learning-based multi-omics data fusion methods for cancer
BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning metho...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9361561/ https://www.ncbi.nlm.nih.gov/pubmed/35945544 http://dx.doi.org/10.1186/s13059-022-02739-2 |
_version_ | 1784764553160032256 |
---|---|
author | Leng, Dongjin Zheng, Linyi Wen, Yuqi Zhang, Yunhao Wu, Lianlian Wang, Jing Wang, Meihong Zhang, Zhongnan He, Song Bo, Xiaochen |
author_facet | Leng, Dongjin Zheng, Linyi Wen, Yuqi Zhang, Yunhao Wu, Lianlian Wang, Jing Wang, Meihong Zhang, Zhongnan He, Song Bo, Xiaochen |
author_sort | Leng, Dongjin |
collection | PubMed |
description | BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples. RESULTS: In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods’ strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks. CONCLUSIONS: Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02739-2. |
format | Online Article Text |
id | pubmed-9361561 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93615612022-08-10 A benchmark study of deep learning-based multi-omics data fusion methods for cancer Leng, Dongjin Zheng, Linyi Wen, Yuqi Zhang, Yunhao Wu, Lianlian Wang, Jing Wang, Meihong Zhang, Zhongnan He, Song Bo, Xiaochen Genome Biol Research BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples. RESULTS: In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods’ strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks. CONCLUSIONS: Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02739-2. BioMed Central 2022-08-09 /pmc/articles/PMC9361561/ /pubmed/35945544 http://dx.doi.org/10.1186/s13059-022-02739-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Leng, Dongjin Zheng, Linyi Wen, Yuqi Zhang, Yunhao Wu, Lianlian Wang, Jing Wang, Meihong Zhang, Zhongnan He, Song Bo, Xiaochen A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title | A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title_full | A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title_fullStr | A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title_full_unstemmed | A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title_short | A benchmark study of deep learning-based multi-omics data fusion methods for cancer |
title_sort | benchmark study of deep learning-based multi-omics data fusion methods for cancer |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9361561/ https://www.ncbi.nlm.nih.gov/pubmed/35945544 http://dx.doi.org/10.1186/s13059-022-02739-2 |
work_keys_str_mv | AT lengdongjin abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhenglinyi abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wenyuqi abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhangyunhao abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wulianlian abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wangjing abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wangmeihong abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhangzhongnan abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT hesong abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT boxiaochen abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT lengdongjin benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhenglinyi benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wenyuqi benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhangyunhao benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wulianlian benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wangjing benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT wangmeihong benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT zhangzhongnan benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT hesong benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer AT boxiaochen benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer |