Cargando…

A benchmark study of deep learning-based multi-omics data fusion methods for cancer

BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning metho...

Descripción completa

Detalles Bibliográficos
Autores principales: Leng, Dongjin, Zheng, Linyi, Wen, Yuqi, Zhang, Yunhao, Wu, Lianlian, Wang, Jing, Wang, Meihong, Zhang, Zhongnan, He, Song, Bo, Xiaochen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9361561/
https://www.ncbi.nlm.nih.gov/pubmed/35945544
http://dx.doi.org/10.1186/s13059-022-02739-2
_version_ 1784764553160032256
author Leng, Dongjin
Zheng, Linyi
Wen, Yuqi
Zhang, Yunhao
Wu, Lianlian
Wang, Jing
Wang, Meihong
Zhang, Zhongnan
He, Song
Bo, Xiaochen
author_facet Leng, Dongjin
Zheng, Linyi
Wen, Yuqi
Zhang, Yunhao
Wu, Lianlian
Wang, Jing
Wang, Meihong
Zhang, Zhongnan
He, Song
Bo, Xiaochen
author_sort Leng, Dongjin
collection PubMed
description BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples. RESULTS: In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods’ strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks. CONCLUSIONS: Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02739-2.
format Online
Article
Text
id pubmed-9361561
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93615612022-08-10 A benchmark study of deep learning-based multi-omics data fusion methods for cancer Leng, Dongjin Zheng, Linyi Wen, Yuqi Zhang, Yunhao Wu, Lianlian Wang, Jing Wang, Meihong Zhang, Zhongnan He, Song Bo, Xiaochen Genome Biol Research BACKGROUND: A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples. RESULTS: In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods’ strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks. CONCLUSIONS: Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02739-2. BioMed Central 2022-08-09 /pmc/articles/PMC9361561/ /pubmed/35945544 http://dx.doi.org/10.1186/s13059-022-02739-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Leng, Dongjin
Zheng, Linyi
Wen, Yuqi
Zhang, Yunhao
Wu, Lianlian
Wang, Jing
Wang, Meihong
Zhang, Zhongnan
He, Song
Bo, Xiaochen
A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title_full A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title_fullStr A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title_full_unstemmed A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title_short A benchmark study of deep learning-based multi-omics data fusion methods for cancer
title_sort benchmark study of deep learning-based multi-omics data fusion methods for cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9361561/
https://www.ncbi.nlm.nih.gov/pubmed/35945544
http://dx.doi.org/10.1186/s13059-022-02739-2
work_keys_str_mv AT lengdongjin abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhenglinyi abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wenyuqi abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhangyunhao abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wulianlian abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wangjing abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wangmeihong abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhangzhongnan abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT hesong abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT boxiaochen abenchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT lengdongjin benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhenglinyi benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wenyuqi benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhangyunhao benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wulianlian benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wangjing benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT wangmeihong benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT zhangzhongnan benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT hesong benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer
AT boxiaochen benchmarkstudyofdeeplearningbasedmultiomicsdatafusionmethodsforcancer