Cargando…
CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computati...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8978562/ https://www.ncbi.nlm.nih.gov/pubmed/35386679 http://dx.doi.org/10.3389/fpls.2022.839044 |
_version_ | 1784680990028857344 |
---|---|
author | Wu, Qiang Li, Dongxi |
author_facet | Wu, Qiang Li, Dongxi |
author_sort | Wu, Qiang |
collection | PubMed |
description | Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computational gene selection algorithm called CRIA (correlation-redundancy and interaction analysis based on gene selection algorithm) is introduced to screen genes that are closely related to cancer from the whole genome based on the value of gene CNVs. The CRIA algorithm mainly consists of two parts. Firstly, the main effect feature is selected out from the original feature set that has the largest correlation with the class label. Secondly, after the analysis involving correlation, redundancy and interaction for each feature in the candidate feature set, we choose the feature that maximizes the value of the custom selection criterion and add it into the selected feature set and then remove it from the candidate feature set in each selection round. Based on the real datasets, CRIA selects the top 200 genes to predict the type of cancer. The experiments' results of our research show that, compared with the state-of-the-art related methods, the CRIA algorithm can extract the key features of CNVs and a better classification performance can be achieved based on them. In addition, the interpretable genes highly related to cancer can be known, which may provide new clues at the genetic level for the treatment of the cancer. |
format | Online Article Text |
id | pubmed-8978562 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89785622022-04-05 CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations Wu, Qiang Li, Dongxi Front Plant Sci Plant Science Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computational gene selection algorithm called CRIA (correlation-redundancy and interaction analysis based on gene selection algorithm) is introduced to screen genes that are closely related to cancer from the whole genome based on the value of gene CNVs. The CRIA algorithm mainly consists of two parts. Firstly, the main effect feature is selected out from the original feature set that has the largest correlation with the class label. Secondly, after the analysis involving correlation, redundancy and interaction for each feature in the candidate feature set, we choose the feature that maximizes the value of the custom selection criterion and add it into the selected feature set and then remove it from the candidate feature set in each selection round. Based on the real datasets, CRIA selects the top 200 genes to predict the type of cancer. The experiments' results of our research show that, compared with the state-of-the-art related methods, the CRIA algorithm can extract the key features of CNVs and a better classification performance can be achieved based on them. In addition, the interpretable genes highly related to cancer can be known, which may provide new clues at the genetic level for the treatment of the cancer. Frontiers Media S.A. 2022-03-21 /pmc/articles/PMC8978562/ /pubmed/35386679 http://dx.doi.org/10.3389/fpls.2022.839044 Text en Copyright © 2022 Wu and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Wu, Qiang Li, Dongxi CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title | CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title_full | CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title_fullStr | CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title_full_unstemmed | CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title_short | CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations |
title_sort | cria: an interactive gene selection algorithm for cancers prediction based on copy number variations |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8978562/ https://www.ncbi.nlm.nih.gov/pubmed/35386679 http://dx.doi.org/10.3389/fpls.2022.839044 |
work_keys_str_mv | AT wuqiang criaaninteractivegeneselectionalgorithmforcancerspredictionbasedoncopynumbervariations AT lidongxi criaaninteractivegeneselectionalgorithmforcancerspredictionbasedoncopynumbervariations |