Cargando…

Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants

BACKGROUND: Since the development of sequencing technology, an enormous amount of genetic information has been generated, and human cancer analysis using this information is drawing attention. As the effects of variants on human cancer become known, it is important to find cancer-associated variants...

Descripción completa

Detalles Bibliográficos
Autores principales: Seo, Hyein, Cho, Dong-Ho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7596964/
https://www.ncbi.nlm.nih.gov/pubmed/33121438
http://dx.doi.org/10.1186/s12859-020-03767-0
_version_ 1783602227399622656
author Seo, Hyein
Cho, Dong-Ho
author_facet Seo, Hyein
Cho, Dong-Ho
author_sort Seo, Hyein
collection PubMed
description BACKGROUND: Since the development of sequencing technology, an enormous amount of genetic information has been generated, and human cancer analysis using this information is drawing attention. As the effects of variants on human cancer become known, it is important to find cancer-associated variants among countless variants. RESULTS: We propose a new filter-based feature selection method applicable for extracting cancer-associated somatic variants considering correlations of data. Both variants associated with the activation and deactivation of cancer’s characteristics are analyzed using dual correlation filters. The multiobjective optimization is utilized to consider two types of variants simultaneously without redundancy. To overcome high computational complexity problem, we calculate the correlation-based weight to select significant variants instead of directly searching for the optimal subset of variants. The proposed algorithm is applied to the identification of melanoma metastasis or breast cancer stage, and the classification results of the proposed method are compared with those of conventional single correlation filter-based method. CONCLUSIONS: We verified that the proposed dual correlation filter-based method can extract cancer-associated variants related to the characteristics of human cancer.
format Online
Article
Text
id pubmed-7596964
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75969642020-11-02 Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants Seo, Hyein Cho, Dong-Ho BMC Bioinformatics Methodology Article BACKGROUND: Since the development of sequencing technology, an enormous amount of genetic information has been generated, and human cancer analysis using this information is drawing attention. As the effects of variants on human cancer become known, it is important to find cancer-associated variants among countless variants. RESULTS: We propose a new filter-based feature selection method applicable for extracting cancer-associated somatic variants considering correlations of data. Both variants associated with the activation and deactivation of cancer’s characteristics are analyzed using dual correlation filters. The multiobjective optimization is utilized to consider two types of variants simultaneously without redundancy. To overcome high computational complexity problem, we calculate the correlation-based weight to select significant variants instead of directly searching for the optimal subset of variants. The proposed algorithm is applied to the identification of melanoma metastasis or breast cancer stage, and the classification results of the proposed method are compared with those of conventional single correlation filter-based method. CONCLUSIONS: We verified that the proposed dual correlation filter-based method can extract cancer-associated variants related to the characteristics of human cancer. BioMed Central 2020-10-30 /pmc/articles/PMC7596964/ /pubmed/33121438 http://dx.doi.org/10.1186/s12859-020-03767-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Seo, Hyein
Cho, Dong-Ho
Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title_full Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title_fullStr Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title_full_unstemmed Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title_short Feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
title_sort feature selection algorithm based on dual correlation filters for cancer-associated somatic variants
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7596964/
https://www.ncbi.nlm.nih.gov/pubmed/33121438
http://dx.doi.org/10.1186/s12859-020-03767-0
work_keys_str_mv AT seohyein featureselectionalgorithmbasedondualcorrelationfiltersforcancerassociatedsomaticvariants
AT chodongho featureselectionalgorithmbasedondualcorrelationfiltersforcancerassociatedsomaticvariants