Cargando…
Biomarker interaction selection and disease detection based on multivariate gain ratio
BACKGROUND: Disease detection is an important aspect of biotherapy. With the development of biotechnology and computer technology, there are many methods to detect disease based on single biomarker. However, biomarker does not influence disease alone in some cases. It’s the interaction between bioma...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103137/ https://www.ncbi.nlm.nih.gov/pubmed/35550010 http://dx.doi.org/10.1186/s12859-022-04699-7 |
_version_ | 1784707490244460544 |
---|---|
author | Chu, Xiao Jiang, Mao Liu, Zhuo-Jun |
author_facet | Chu, Xiao Jiang, Mao Liu, Zhuo-Jun |
author_sort | Chu, Xiao |
collection | PubMed |
description | BACKGROUND: Disease detection is an important aspect of biotherapy. With the development of biotechnology and computer technology, there are many methods to detect disease based on single biomarker. However, biomarker does not influence disease alone in some cases. It’s the interaction between biomarkers that determines disease status. The existing influence measure I-score is used to evaluate the importance of interaction in determining disease status, but there is a deviation about the number of variables in interaction when applying I-score. To solve the problem, we propose a new influence measure Multivariate Gain Ratio (MGR) based on Gain Ratio (GR) of single-variate, which provides us with multivariate combination called interaction. RESULTS: We propose a preprocessing verification algorithm based on partial predictor variables to select an appropriate preprocessing method. In this paper, an algorithm for selecting key interactions of biomarkers and applying key interactions to construct a disease detection model is provided. MGR is more credible than I-score in the case of interaction containing small number of variables. Our method behaves better with average accuracy [Formula: see text] than I-score of [Formula: see text] in Breast Cancer Wisconsin (Diagnostic) Dataset. Compared to the classification results [Formula: see text] based on all predictor variables, MGR identifies the true main biomarkers and realizes the dimension reduction. In Leukemia Dataset, the experiment results show the effectiveness of MGR with the accuracy of [Formula: see text] compared to I-score with accuracy [Formula: see text] . The results can be explained by the nature of MGR and I-score mentioned above because every key interaction contains a small number of variables in Leukemia Dataset. CONCLUSIONS: MGR is effective for selecting important biomarkers and biomarker interactions even in high-dimension feature space in which the interaction could contain more than two biomarkers. The prediction ability of interactions selected by MGR is better than I-score in the case of interaction containing small number of variables. MGR is generally applicable to various types of biomarker datasets including cell nuclei, gene, SNPs and protein datasets. |
format | Online Article Text |
id | pubmed-9103137 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91031372022-05-14 Biomarker interaction selection and disease detection based on multivariate gain ratio Chu, Xiao Jiang, Mao Liu, Zhuo-Jun BMC Bioinformatics Research BACKGROUND: Disease detection is an important aspect of biotherapy. With the development of biotechnology and computer technology, there are many methods to detect disease based on single biomarker. However, biomarker does not influence disease alone in some cases. It’s the interaction between biomarkers that determines disease status. The existing influence measure I-score is used to evaluate the importance of interaction in determining disease status, but there is a deviation about the number of variables in interaction when applying I-score. To solve the problem, we propose a new influence measure Multivariate Gain Ratio (MGR) based on Gain Ratio (GR) of single-variate, which provides us with multivariate combination called interaction. RESULTS: We propose a preprocessing verification algorithm based on partial predictor variables to select an appropriate preprocessing method. In this paper, an algorithm for selecting key interactions of biomarkers and applying key interactions to construct a disease detection model is provided. MGR is more credible than I-score in the case of interaction containing small number of variables. Our method behaves better with average accuracy [Formula: see text] than I-score of [Formula: see text] in Breast Cancer Wisconsin (Diagnostic) Dataset. Compared to the classification results [Formula: see text] based on all predictor variables, MGR identifies the true main biomarkers and realizes the dimension reduction. In Leukemia Dataset, the experiment results show the effectiveness of MGR with the accuracy of [Formula: see text] compared to I-score with accuracy [Formula: see text] . The results can be explained by the nature of MGR and I-score mentioned above because every key interaction contains a small number of variables in Leukemia Dataset. CONCLUSIONS: MGR is effective for selecting important biomarkers and biomarker interactions even in high-dimension feature space in which the interaction could contain more than two biomarkers. The prediction ability of interactions selected by MGR is better than I-score in the case of interaction containing small number of variables. MGR is generally applicable to various types of biomarker datasets including cell nuclei, gene, SNPs and protein datasets. BioMed Central 2022-05-12 /pmc/articles/PMC9103137/ /pubmed/35550010 http://dx.doi.org/10.1186/s12859-022-04699-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chu, Xiao Jiang, Mao Liu, Zhuo-Jun Biomarker interaction selection and disease detection based on multivariate gain ratio |
title | Biomarker interaction selection and disease detection based on multivariate gain ratio |
title_full | Biomarker interaction selection and disease detection based on multivariate gain ratio |
title_fullStr | Biomarker interaction selection and disease detection based on multivariate gain ratio |
title_full_unstemmed | Biomarker interaction selection and disease detection based on multivariate gain ratio |
title_short | Biomarker interaction selection and disease detection based on multivariate gain ratio |
title_sort | biomarker interaction selection and disease detection based on multivariate gain ratio |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103137/ https://www.ncbi.nlm.nih.gov/pubmed/35550010 http://dx.doi.org/10.1186/s12859-022-04699-7 |
work_keys_str_mv | AT chuxiao biomarkerinteractionselectionanddiseasedetectionbasedonmultivariategainratio AT jiangmao biomarkerinteractionselectionanddiseasedetectionbasedonmultivariategainratio AT liuzhuojun biomarkerinteractionselectionanddiseasedetectionbasedonmultivariategainratio |