Cargando…
MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model
BACKGROUND: Accurately classifying complex diseases is crucial for diagnosis and personalized treatment. Integrating multi-omics data has been demonstrated to enhance the accuracy of analyzing and classifying complex diseases. This can be attributed to the highly correlated nature of the data with v...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161645/ https://www.ncbi.nlm.nih.gov/pubmed/37147619 http://dx.doi.org/10.1186/s12911-023-02173-9 |
_version_ | 1785037536083574784 |
---|---|
author | Zhong, Yating Peng, Yuzhong Lin, Yanmei Chen, Dingjia Zhang, Hao Zheng, Wen Chen, Yuanyuan Wu, Changliang |
author_facet | Zhong, Yating Peng, Yuzhong Lin, Yanmei Chen, Dingjia Zhang, Hao Zheng, Wen Chen, Yuanyuan Wu, Changliang |
author_sort | Zhong, Yating |
collection | PubMed |
description | BACKGROUND: Accurately classifying complex diseases is crucial for diagnosis and personalized treatment. Integrating multi-omics data has been demonstrated to enhance the accuracy of analyzing and classifying complex diseases. This can be attributed to the highly correlated nature of the data with various diseases, as well as the comprehensive and complementary information it provides. However, integrating multi-omics data for complex diseases is challenged by data characteristics such as high imbalance, scale variation, heterogeneity, and noise interference. These challenges further emphasize the importance of developing effective methods for multi-omics data integration. RESULTS: We proposed a novel multi-omics data learning model called MODILM, which integrates multiple omics data to improve the classification accuracy of complex diseases by obtaining more significant and complementary information from different single-omics data. Our approach includes four key steps: 1) constructing a similarity network for each omics data using the cosine similarity measure, 2) leveraging Graph Attention Networks to learn sample-specific and intra-association features from similarity networks for single-omics data, 3) using Multilayer Perceptron networks to map learned features to a new feature space, thereby strengthening and extracting high-level omics-specific features, and 4) fusing these high-level features using a View Correlation Discovery Network to learn cross-omics features in the label space, which results in unique class-level distinctiveness for complex diseases. To demonstrate the effectiveness of MODILM, we conducted experiments on six benchmark datasets consisting of miRNA expression, mRNA, and DNA methylation data. Our results show that MODILM outperforms state-of-the-art methods, effectively improving the accuracy of complex disease classification. CONCLUSIONS: Our MODILM provides a more competitive way to extract and integrate important and complementary information from multiple omics data, providing a very promising tool for supporting decision-making for clinical diagnosis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02173-9. |
format | Online Article Text |
id | pubmed-10161645 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101616452023-05-06 MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model Zhong, Yating Peng, Yuzhong Lin, Yanmei Chen, Dingjia Zhang, Hao Zheng, Wen Chen, Yuanyuan Wu, Changliang BMC Med Inform Decis Mak Research BACKGROUND: Accurately classifying complex diseases is crucial for diagnosis and personalized treatment. Integrating multi-omics data has been demonstrated to enhance the accuracy of analyzing and classifying complex diseases. This can be attributed to the highly correlated nature of the data with various diseases, as well as the comprehensive and complementary information it provides. However, integrating multi-omics data for complex diseases is challenged by data characteristics such as high imbalance, scale variation, heterogeneity, and noise interference. These challenges further emphasize the importance of developing effective methods for multi-omics data integration. RESULTS: We proposed a novel multi-omics data learning model called MODILM, which integrates multiple omics data to improve the classification accuracy of complex diseases by obtaining more significant and complementary information from different single-omics data. Our approach includes four key steps: 1) constructing a similarity network for each omics data using the cosine similarity measure, 2) leveraging Graph Attention Networks to learn sample-specific and intra-association features from similarity networks for single-omics data, 3) using Multilayer Perceptron networks to map learned features to a new feature space, thereby strengthening and extracting high-level omics-specific features, and 4) fusing these high-level features using a View Correlation Discovery Network to learn cross-omics features in the label space, which results in unique class-level distinctiveness for complex diseases. To demonstrate the effectiveness of MODILM, we conducted experiments on six benchmark datasets consisting of miRNA expression, mRNA, and DNA methylation data. Our results show that MODILM outperforms state-of-the-art methods, effectively improving the accuracy of complex disease classification. CONCLUSIONS: Our MODILM provides a more competitive way to extract and integrate important and complementary information from multiple omics data, providing a very promising tool for supporting decision-making for clinical diagnosis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02173-9. BioMed Central 2023-05-05 /pmc/articles/PMC10161645/ /pubmed/37147619 http://dx.doi.org/10.1186/s12911-023-02173-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhong, Yating Peng, Yuzhong Lin, Yanmei Chen, Dingjia Zhang, Hao Zheng, Wen Chen, Yuanyuan Wu, Changliang MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title | MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title_full | MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title_fullStr | MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title_full_unstemmed | MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title_short | MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model |
title_sort | modilm: towards better complex diseases classification using a novel multi-omics data integration learning model |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161645/ https://www.ncbi.nlm.nih.gov/pubmed/37147619 http://dx.doi.org/10.1186/s12911-023-02173-9 |
work_keys_str_mv | AT zhongyating modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT pengyuzhong modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT linyanmei modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT chendingjia modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT zhanghao modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT zhengwen modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT chenyuanyuan modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel AT wuchangliang modilmtowardsbettercomplexdiseasesclassificationusinganovelmultiomicsdataintegrationlearningmodel |