Cargando…
Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which con...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511763/ https://www.ncbi.nlm.nih.gov/pubmed/33133130 http://dx.doi.org/10.3389/fgene.2020.00979 |
_version_ | 1783586020037492736 |
---|---|
author | Li, Shuhao Jiang, Limin Tang, Jijun Gao, Nan Guo, Fei |
author_facet | Li, Shuhao Jiang, Limin Tang, Jijun Gao, Nan Guo, Fei |
author_sort | Li, Shuhao |
collection | PubMed |
description | Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which contains gene expression, miRNA expression, and isoform expression of five cancers from The Cancer Genome Atlas (TCGA). Next, to avoid the effect of noise existing in 60, 483 genes, we select a small number of genes by using LASSO that employs gene expression and survival time of patients. Then, we construct one similarity kernel for each expression data by using Chebyshev distance. And also, We used SKF to fused the three similarity matrix composed of gene, Iso, and miRNA, and finally clustered the fused similarity matrix with spectral clustering. In the experimental results, our method has better P-value in the Cox model than other methods on 10 cancer data from Jiang Dataset and Novel Dataset. We have drawn different survival curves for different cancers and found that some genes play a key role in cancer. For breast cancer, we find out that HSPA2A, RNASE1, CLIC6, and IFITM1 are highly expressed in some specific groups. For lung cancer, we ensure that C4BPA, SESN3, and IRS1 are highly expressed in some specific groups. The code and all supporting data files are available from https://github.com/guofei-tju/Uncovering-Cancer-Subtypes-via-LASSO. |
format | Online Article Text |
id | pubmed-7511763 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75117632020-10-30 Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data Li, Shuhao Jiang, Limin Tang, Jijun Gao, Nan Guo, Fei Front Genet Genetics Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which contains gene expression, miRNA expression, and isoform expression of five cancers from The Cancer Genome Atlas (TCGA). Next, to avoid the effect of noise existing in 60, 483 genes, we select a small number of genes by using LASSO that employs gene expression and survival time of patients. Then, we construct one similarity kernel for each expression data by using Chebyshev distance. And also, We used SKF to fused the three similarity matrix composed of gene, Iso, and miRNA, and finally clustered the fused similarity matrix with spectral clustering. In the experimental results, our method has better P-value in the Cox model than other methods on 10 cancer data from Jiang Dataset and Novel Dataset. We have drawn different survival curves for different cancers and found that some genes play a key role in cancer. For breast cancer, we find out that HSPA2A, RNASE1, CLIC6, and IFITM1 are highly expressed in some specific groups. For lung cancer, we ensure that C4BPA, SESN3, and IRS1 are highly expressed in some specific groups. The code and all supporting data files are available from https://github.com/guofei-tju/Uncovering-Cancer-Subtypes-via-LASSO. Frontiers Media S.A. 2020-09-10 /pmc/articles/PMC7511763/ /pubmed/33133130 http://dx.doi.org/10.3389/fgene.2020.00979 Text en Copyright © 2020 Li, Jiang, Tang, Gao and Guo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Li, Shuhao Jiang, Limin Tang, Jijun Gao, Nan Guo, Fei Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title | Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title_full | Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title_fullStr | Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title_full_unstemmed | Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title_short | Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data |
title_sort | kernel fusion method for detecting cancer subtypes via selecting relevant expression data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511763/ https://www.ncbi.nlm.nih.gov/pubmed/33133130 http://dx.doi.org/10.3389/fgene.2020.00979 |
work_keys_str_mv | AT lishuhao kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata AT jianglimin kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata AT tangjijun kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata AT gaonan kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata AT guofei kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata |