Cargando…
Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines
There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer t...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669964/ https://www.ncbi.nlm.nih.gov/pubmed/34917619 http://dx.doi.org/10.3389/fcell.2021.781285 |
_version_ | 1784614881268334592 |
---|---|
author | Ding, ShiJian Li, Hao Zhang, Yu-Hang Zhou, XianChao Feng, KaiYan Li, ZhanDong Chen, Lei Huang, Tao Cai, Yu-Dong |
author_facet | Ding, ShiJian Li, Hao Zhang, Yu-Hang Zhou, XianChao Feng, KaiYan Li, ZhanDong Chen, Lei Huang, Tao Cai, Yu-Dong |
author_sort | Ding, ShiJian |
collection | PubMed |
description | There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer types? After the Cancer Genome Atlas (TCGA) project, there are more and more pan-cancer studies. Researchers want to get robust gene expression signature from pan-cancer patients. But there is large variance in cancer patients due to heterogeneity. To get robust results, the sample size will be too large to recruit. In this study, we tried another approach to get robust pan-cancer biomarkers by using the cell line data to reduce the variance. We applied several advanced computational methods to analyze the Cancer Cell Line Encyclopedia (CCLE) gene expression profiles which included 988 cell lines from 20 cancer types. Two feature selection methods, including Boruta, and max-relevance and min-redundancy methods, were applied to the cell line gene expression data one by one, generating a feature list. Such list was fed into incremental feature selection method, incorporating one classification algorithm, to extract biomarkers, construct optimal classifiers and decision rules. The optimal classifiers provided good performance, which can be useful tools to identify cell lines from different cancer types, whereas the biomarkers (e.g. NCKAP1, TNFRSF12A, LAMB2, FKBP9, PFN2, TOM1L1) and rules identified in this work may provide a meaningful and precise reference for differentiating multiple types of cancer and contribute to the personalized treatment of tumors. |
format | Online Article Text |
id | pubmed-8669964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86699642021-12-15 Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines Ding, ShiJian Li, Hao Zhang, Yu-Hang Zhou, XianChao Feng, KaiYan Li, ZhanDong Chen, Lei Huang, Tao Cai, Yu-Dong Front Cell Dev Biol Cell and Developmental Biology There are many types of cancers. Although they share some hallmarks, such as proliferation and metastasis, they are still very different from many perspectives. They grow on different organ or tissues. Does each cancer have a unique gene expression pattern that makes it different from other cancer types? After the Cancer Genome Atlas (TCGA) project, there are more and more pan-cancer studies. Researchers want to get robust gene expression signature from pan-cancer patients. But there is large variance in cancer patients due to heterogeneity. To get robust results, the sample size will be too large to recruit. In this study, we tried another approach to get robust pan-cancer biomarkers by using the cell line data to reduce the variance. We applied several advanced computational methods to analyze the Cancer Cell Line Encyclopedia (CCLE) gene expression profiles which included 988 cell lines from 20 cancer types. Two feature selection methods, including Boruta, and max-relevance and min-redundancy methods, were applied to the cell line gene expression data one by one, generating a feature list. Such list was fed into incremental feature selection method, incorporating one classification algorithm, to extract biomarkers, construct optimal classifiers and decision rules. The optimal classifiers provided good performance, which can be useful tools to identify cell lines from different cancer types, whereas the biomarkers (e.g. NCKAP1, TNFRSF12A, LAMB2, FKBP9, PFN2, TOM1L1) and rules identified in this work may provide a meaningful and precise reference for differentiating multiple types of cancer and contribute to the personalized treatment of tumors. Frontiers Media S.A. 2021-11-30 /pmc/articles/PMC8669964/ /pubmed/34917619 http://dx.doi.org/10.3389/fcell.2021.781285 Text en Copyright © 2021 Ding, Li, Zhang, Zhou, Feng, Li, Chen, Huang and Cai. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Ding, ShiJian Li, Hao Zhang, Yu-Hang Zhou, XianChao Feng, KaiYan Li, ZhanDong Chen, Lei Huang, Tao Cai, Yu-Dong Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title | Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title_full | Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title_fullStr | Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title_full_unstemmed | Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title_short | Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines |
title_sort | identification of pan-cancer biomarkers based on the gene expression profiles of cancer cell lines |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669964/ https://www.ncbi.nlm.nih.gov/pubmed/34917619 http://dx.doi.org/10.3389/fcell.2021.781285 |
work_keys_str_mv | AT dingshijian identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT lihao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT zhangyuhang identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT zhouxianchao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT fengkaiyan identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT lizhandong identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT chenlei identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT huangtao identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines AT caiyudong identificationofpancancerbiomarkersbasedonthegeneexpressionprofilesofcancercelllines |