Cargando…

Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data

Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which con...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Shuhao, Jiang, Limin, Tang, Jijun, Gao, Nan, Guo, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511763/
https://www.ncbi.nlm.nih.gov/pubmed/33133130
http://dx.doi.org/10.3389/fgene.2020.00979
_version_ 1783586020037492736
author Li, Shuhao
Jiang, Limin
Tang, Jijun
Gao, Nan
Guo, Fei
author_facet Li, Shuhao
Jiang, Limin
Tang, Jijun
Gao, Nan
Guo, Fei
author_sort Li, Shuhao
collection PubMed
description Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which contains gene expression, miRNA expression, and isoform expression of five cancers from The Cancer Genome Atlas (TCGA). Next, to avoid the effect of noise existing in 60, 483 genes, we select a small number of genes by using LASSO that employs gene expression and survival time of patients. Then, we construct one similarity kernel for each expression data by using Chebyshev distance. And also, We used SKF to fused the three similarity matrix composed of gene, Iso, and miRNA, and finally clustered the fused similarity matrix with spectral clustering. In the experimental results, our method has better P-value in the Cox model than other methods on 10 cancer data from Jiang Dataset and Novel Dataset. We have drawn different survival curves for different cancers and found that some genes play a key role in cancer. For breast cancer, we find out that HSPA2A, RNASE1, CLIC6, and IFITM1 are highly expressed in some specific groups. For lung cancer, we ensure that C4BPA, SESN3, and IRS1 are highly expressed in some specific groups. The code and all supporting data files are available from https://github.com/guofei-tju/Uncovering-Cancer-Subtypes-via-LASSO.
format Online
Article
Text
id pubmed-7511763
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-75117632020-10-30 Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data Li, Shuhao Jiang, Limin Tang, Jijun Gao, Nan Guo, Fei Front Genet Genetics Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which contains gene expression, miRNA expression, and isoform expression of five cancers from The Cancer Genome Atlas (TCGA). Next, to avoid the effect of noise existing in 60, 483 genes, we select a small number of genes by using LASSO that employs gene expression and survival time of patients. Then, we construct one similarity kernel for each expression data by using Chebyshev distance. And also, We used SKF to fused the three similarity matrix composed of gene, Iso, and miRNA, and finally clustered the fused similarity matrix with spectral clustering. In the experimental results, our method has better P-value in the Cox model than other methods on 10 cancer data from Jiang Dataset and Novel Dataset. We have drawn different survival curves for different cancers and found that some genes play a key role in cancer. For breast cancer, we find out that HSPA2A, RNASE1, CLIC6, and IFITM1 are highly expressed in some specific groups. For lung cancer, we ensure that C4BPA, SESN3, and IRS1 are highly expressed in some specific groups. The code and all supporting data files are available from https://github.com/guofei-tju/Uncovering-Cancer-Subtypes-via-LASSO. Frontiers Media S.A. 2020-09-10 /pmc/articles/PMC7511763/ /pubmed/33133130 http://dx.doi.org/10.3389/fgene.2020.00979 Text en Copyright © 2020 Li, Jiang, Tang, Gao and Guo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Shuhao
Jiang, Limin
Tang, Jijun
Gao, Nan
Guo, Fei
Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title_full Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title_fullStr Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title_full_unstemmed Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title_short Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data
title_sort kernel fusion method for detecting cancer subtypes via selecting relevant expression data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511763/
https://www.ncbi.nlm.nih.gov/pubmed/33133130
http://dx.doi.org/10.3389/fgene.2020.00979
work_keys_str_mv AT lishuhao kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata
AT jianglimin kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata
AT tangjijun kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata
AT gaonan kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata
AT guofei kernelfusionmethodfordetectingcancersubtypesviaselectingrelevantexpressiondata