Cargando…

Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold

BACKGROUND: Patient subgroups are important for easily understanding a disease and for providing precise yet personalized treatment through multiple omics dataset integration. Multiomics datasets are produced daily. Thus, the fusion of heterogeneous big data into intrinsic structures is an urgent pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Alfatemi, Ali, Peng, Hong, Rong, Wentao, Zhang, Bin, Cai, Hongmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9308936/
https://www.ncbi.nlm.nih.gov/pubmed/35870923
http://dx.doi.org/10.1186/s12911-022-01938-y
_version_ 1784753050044334080
author Alfatemi, Ali
Peng, Hong
Rong, Wentao
Zhang, Bin
Cai, Hongmin
author_facet Alfatemi, Ali
Peng, Hong
Rong, Wentao
Zhang, Bin
Cai, Hongmin
author_sort Alfatemi, Ali
collection PubMed
description BACKGROUND: Patient subgroups are important for easily understanding a disease and for providing precise yet personalized treatment through multiple omics dataset integration. Multiomics datasets are produced daily. Thus, the fusion of heterogeneous big data into intrinsic structures is an urgent problem. Novel mathematical methods are needed to process these data in a straightforward way. RESULTS: We developed a novel method for subgrouping patients with distinct survival rates via the integration of multiple omics datasets and by using principal component analysis to reduce the high data dimensionality. Then, we constructed similarity graphs for patients, merged the graphs in a subspace, and analyzed them on a Grassmann manifold. The proposed method could identify patient subgroups that had not been reported previously by selecting the most critical information during the merging at each level of the omics dataset. Our method was tested on empirical multiomics datasets from The Cancer Genome Atlas. CONCLUSION: Through the integration of microRNA, gene expression, and DNA methylation data, our method accurately identified patient subgroups and achieved superior performance compared with popular methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01938-y.
format Online
Article
Text
id pubmed-9308936
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93089362022-07-25 Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold Alfatemi, Ali Peng, Hong Rong, Wentao Zhang, Bin Cai, Hongmin BMC Med Inform Decis Mak Research BACKGROUND: Patient subgroups are important for easily understanding a disease and for providing precise yet personalized treatment through multiple omics dataset integration. Multiomics datasets are produced daily. Thus, the fusion of heterogeneous big data into intrinsic structures is an urgent problem. Novel mathematical methods are needed to process these data in a straightforward way. RESULTS: We developed a novel method for subgrouping patients with distinct survival rates via the integration of multiple omics datasets and by using principal component analysis to reduce the high data dimensionality. Then, we constructed similarity graphs for patients, merged the graphs in a subspace, and analyzed them on a Grassmann manifold. The proposed method could identify patient subgroups that had not been reported previously by selecting the most critical information during the merging at each level of the omics dataset. Our method was tested on empirical multiomics datasets from The Cancer Genome Atlas. CONCLUSION: Through the integration of microRNA, gene expression, and DNA methylation data, our method accurately identified patient subgroups and achieved superior performance compared with popular methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01938-y. BioMed Central 2022-07-23 /pmc/articles/PMC9308936/ /pubmed/35870923 http://dx.doi.org/10.1186/s12911-022-01938-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Alfatemi, Ali
Peng, Hong
Rong, Wentao
Zhang, Bin
Cai, Hongmin
Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title_full Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title_fullStr Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title_full_unstemmed Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title_short Patient subgrouping with distinct survival rates via integration of multiomics data on a Grassmann manifold
title_sort patient subgrouping with distinct survival rates via integration of multiomics data on a grassmann manifold
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9308936/
https://www.ncbi.nlm.nih.gov/pubmed/35870923
http://dx.doi.org/10.1186/s12911-022-01938-y
work_keys_str_mv AT alfatemiali patientsubgroupingwithdistinctsurvivalratesviaintegrationofmultiomicsdataonagrassmannmanifold
AT penghong patientsubgroupingwithdistinctsurvivalratesviaintegrationofmultiomicsdataonagrassmannmanifold
AT rongwentao patientsubgroupingwithdistinctsurvivalratesviaintegrationofmultiomicsdataonagrassmannmanifold
AT zhangbin patientsubgroupingwithdistinctsurvivalratesviaintegrationofmultiomicsdataonagrassmannmanifold
AT caihongmin patientsubgroupingwithdistinctsurvivalratesviaintegrationofmultiomicsdataonagrassmannmanifold