Cargando…

Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines

There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer t...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, ShiJian, Li, Hao, Zhang, Yu-Hang, Zhou, XianChao, Feng, KaiYan, Li, ZhanDong, Chen, Lei, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669964/
https://www.ncbi.nlm.nih.gov/pubmed/34917619
http://dx.doi.org/10.3389/fcell.2021.781285
_version_ 1784614881268334592
author Ding, ShiJian
Li, Hao
Zhang, Yu-Hang
Zhou, XianChao
Feng, KaiYan
Li, ZhanDong
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_facet Ding, ShiJian
Li, Hao
Zhang, Yu-Hang
Zhou, XianChao
Feng, KaiYan
Li, ZhanDong
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_sort Ding, ShiJian
collection PubMed
description There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer types? After the Cancer Genome Atlas (TCGA) project, there are more and more pan-cancer studies. Researchers want to get robust gene expression signature from pan-cancer patients. But there is large variance in cancer patients due to heterogeneity. To get robust results, the sample size will be too large to recruit. In this study, we tried another approach to get robust pan-cancer biomarkers by using the cell line data to reduce the variance. We applied several advanced computational methods to analyze the Cancer Cell Line Encyclopedia (CCLE) gene expression profiles which included 988 cell lines from 20 cancer types. Two feature selection methods, including Boruta, and max-relevance and min-redundancy methods, were applied to the cell line gene expression data one by one, generating a feature list. Such list was fed into incremental feature selection method, incorporating one classification algorithm, to extract biomarkers, construct optimal classifiers and decision rules. The optimal classifiers provided good performance, which can be useful tools to identify cell lines from different cancer types, whereas the biomarkers (e.g. NCKAP1, TNFRSF12A, LAMB2, FKBP9, PFN2, TOM1L1) and rules identified in this work may provide a meaningful and precise reference for differentiating multiple types of cancer and contribute to the personalized treatment of tumors.
format Online
Article
Text
id pubmed-8669964
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86699642021-12-15 Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines Ding, ShiJian Li, Hao Zhang, Yu-Hang Zhou, XianChao Feng, KaiYan Li, ZhanDong Chen, Lei Huang, Tao Cai, Yu-Dong Front Cell Dev Biol Cell and Developmental Biology There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer types? After the Cancer Genome Atlas (TCGA) project, there are more and more pan-cancer studies. Researchers want to get robust gene expression signature from pan-cancer patients. But there is large variance in cancer patients due to heterogeneity. To get robust results, the sample size will be too large to recruit. In this study, we tried another approach to get robust pan-cancer biomarkers by using the cell line data to reduce the variance. We applied several advanced computational methods to analyze the Cancer Cell Line Encyclopedia (CCLE) gene expression profiles which included 988 cell lines from 20 cancer types. Two feature selection methods, including Boruta, and max-relevance and min-redundancy methods, were applied to the cell line gene expression data one by one, generating a feature list. Such list was fed into incremental feature selection method, incorporating one classification algorithm, to extract biomarkers, construct optimal classifiers and decision rules. The optimal classifiers provided good performance, which can be useful tools to identify cell lines from different cancer types, whereas the biomarkers (e.g. NCKAP1, TNFRSF12A, LAMB2, FKBP9, PFN2, TOM1L1) and rules identified in this work may provide a meaningful and precise reference for differentiating multiple types of cancer and contribute to the personalized treatment of tumors. Frontiers Media S.A. 2021-11-30 /pmc/articles/PMC8669964/ /pubmed/34917619 http://dx.doi.org/10.3389/fcell.2021.781285 Text en Copyright © 2021 Ding, Li, Zhang, Zhou, Feng, Li, Chen, Huang and Cai. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Ding, ShiJian
Li, Hao
Zhang, Yu-Hang
Zhou, XianChao
Feng, KaiYan
Li, ZhanDong
Chen, Lei
Huang, Tao
Cai, Yu-Dong
Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title_full Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title_fullStr Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title_full_unstemmed Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title_short Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
title_sort identification of pan-cancer biomarkers based on the gene expression profiles of cancer cell lines
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669964/
https://www.ncbi.nlm.nih.gov/pubmed/34917619
http://dx.doi.org/10.3389/fcell.2021.781285
work_keys_str_mv AT dingshijian identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT lihao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT zhangyuhang identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT zhouxianchao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT fengkaiyan identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT lizhandong identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT chenlei identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT huangtao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines
AT caiyudong identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines