Cargando…

CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations

Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computati...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Qiang, Li, Dongxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8978562/
https://www.ncbi.nlm.nih.gov/pubmed/35386679
http://dx.doi.org/10.3389/fpls.2022.839044
_version_ 1784680990028857344
author Wu, Qiang
Li, Dongxi
author_facet Wu, Qiang
Li, Dongxi
author_sort Wu, Qiang
collection PubMed
description Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computational gene selection algorithm called CRIA (correlation-redundancy and interaction analysis based on gene selection algorithm) is introduced to screen genes that are closely related to cancer from the whole genome based on the value of gene CNVs. The CRIA algorithm mainly consists of two parts. Firstly, the main effect feature is selected out from the original feature set that has the largest correlation with the class label. Secondly, after the analysis involving correlation, redundancy and interaction for each feature in the candidate feature set, we choose the feature that maximizes the value of the custom selection criterion and add it into the selected feature set and then remove it from the candidate feature set in each selection round. Based on the real datasets, CRIA selects the top 200 genes to predict the type of cancer. The experiments' results of our research show that, compared with the state-of-the-art related methods, the CRIA algorithm can extract the key features of CNVs and a better classification performance can be achieved based on them. In addition, the interpretable genes highly related to cancer can be known, which may provide new clues at the genetic level for the treatment of the cancer.
format Online
Article
Text
id pubmed-8978562
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89785622022-04-05 CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations Wu, Qiang Li, Dongxi Front Plant Sci Plant Science Genomic copy number variations (CNVs) are among the most important structural variations of genes found to be related to the risk of individual cancer and therefore they can be utilized to provide a clue to the research on the formation and progression of cancer. In this paper, an improved computational gene selection algorithm called CRIA (correlation-redundancy and interaction analysis based on gene selection algorithm) is introduced to screen genes that are closely related to cancer from the whole genome based on the value of gene CNVs. The CRIA algorithm mainly consists of two parts. Firstly, the main effect feature is selected out from the original feature set that has the largest correlation with the class label. Secondly, after the analysis involving correlation, redundancy and interaction for each feature in the candidate feature set, we choose the feature that maximizes the value of the custom selection criterion and add it into the selected feature set and then remove it from the candidate feature set in each selection round. Based on the real datasets, CRIA selects the top 200 genes to predict the type of cancer. The experiments' results of our research show that, compared with the state-of-the-art related methods, the CRIA algorithm can extract the key features of CNVs and a better classification performance can be achieved based on them. In addition, the interpretable genes highly related to cancer can be known, which may provide new clues at the genetic level for the treatment of the cancer. Frontiers Media S.A. 2022-03-21 /pmc/articles/PMC8978562/ /pubmed/35386679 http://dx.doi.org/10.3389/fpls.2022.839044 Text en Copyright © 2022 Wu and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Wu, Qiang
Li, Dongxi
CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title_full CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title_fullStr CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title_full_unstemmed CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title_short CRIA: An Interactive Gene Selection Algorithm for Cancers Prediction Based on Copy Number Variations
title_sort cria: an interactive gene selection algorithm for cancers prediction based on copy number variations
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8978562/
https://www.ncbi.nlm.nih.gov/pubmed/35386679
http://dx.doi.org/10.3389/fpls.2022.839044
work_keys_str_mv AT wuqiang criaaninteractivegeneselectionalgorithmforcancerspredictionbasedoncopynumbervariations
AT lidongxi criaaninteractivegeneselectionalgorithmforcancerspredictionbasedoncopynumbervariations