Cargando…

Quasi-linear score for capturing heterogeneous structure in biomarkers

BACKGROUND: Linear scores are widely used to predict dichotomous outcomes in biomedical studies because of their learnability and understandability. Such approaches, however, cannot be used to elucidate biodiversity when there is heterogeneous structure in target population. RESULTS: Our study was f...

Descripción completa

Detalles Bibliográficos
Autores principales: Omae, Katsuhiro, Komori, Osamu, Eguchi, Shinto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5477283/
https://www.ncbi.nlm.nih.gov/pubmed/28629325
http://dx.doi.org/10.1186/s12859-017-1721-x
_version_ 1783244760052400128
author Omae, Katsuhiro
Komori, Osamu
Eguchi, Shinto
author_facet Omae, Katsuhiro
Komori, Osamu
Eguchi, Shinto
author_sort Omae, Katsuhiro
collection PubMed
description BACKGROUND: Linear scores are widely used to predict dichotomous outcomes in biomedical studies because of their learnability and understandability. Such approaches, however, cannot be used to elucidate biodiversity when there is heterogeneous structure in target population. RESULTS: Our study was focused on describing intrinsic heterogeneity in predictions. Because heterogeneity can be captured by a clustering method, integrating different information from different clusters should yield better predictions. Accordingly, we developed a quasi-linear score, which effectively combines the linear scores of clustered markers. We extended the linear score to the quasi-linear score by a generalized average form, the Kolmogorov-Nagumo average. We observed that two shrinkage methods worked well: ridge shrinkage for estimating the quasi-linear score, and lasso shrinkage for selecting markers within each cluster. Simulation studies and applications to real data show that the proposed method has good predictive performance compared with existing methods. CONCLUSIONS: Heterogeneous structure is captured by a clustering method. Quasi-linear scores combine such heterogeneity and have a better predictive ability compared with linear scores. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1721-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5477283
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54772832017-06-23 Quasi-linear score for capturing heterogeneous structure in biomarkers Omae, Katsuhiro Komori, Osamu Eguchi, Shinto BMC Bioinformatics Methodology Article BACKGROUND: Linear scores are widely used to predict dichotomous outcomes in biomedical studies because of their learnability and understandability. Such approaches, however, cannot be used to elucidate biodiversity when there is heterogeneous structure in target population. RESULTS: Our study was focused on describing intrinsic heterogeneity in predictions. Because heterogeneity can be captured by a clustering method, integrating different information from different clusters should yield better predictions. Accordingly, we developed a quasi-linear score, which effectively combines the linear scores of clustered markers. We extended the linear score to the quasi-linear score by a generalized average form, the Kolmogorov-Nagumo average. We observed that two shrinkage methods worked well: ridge shrinkage for estimating the quasi-linear score, and lasso shrinkage for selecting markers within each cluster. Simulation studies and applications to real data show that the proposed method has good predictive performance compared with existing methods. CONCLUSIONS: Heterogeneous structure is captured by a clustering method. Quasi-linear scores combine such heterogeneity and have a better predictive ability compared with linear scores. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1721-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-19 /pmc/articles/PMC5477283/ /pubmed/28629325 http://dx.doi.org/10.1186/s12859-017-1721-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Omae, Katsuhiro
Komori, Osamu
Eguchi, Shinto
Quasi-linear score for capturing heterogeneous structure in biomarkers
title Quasi-linear score for capturing heterogeneous structure in biomarkers
title_full Quasi-linear score for capturing heterogeneous structure in biomarkers
title_fullStr Quasi-linear score for capturing heterogeneous structure in biomarkers
title_full_unstemmed Quasi-linear score for capturing heterogeneous structure in biomarkers
title_short Quasi-linear score for capturing heterogeneous structure in biomarkers
title_sort quasi-linear score for capturing heterogeneous structure in biomarkers
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5477283/
https://www.ncbi.nlm.nih.gov/pubmed/28629325
http://dx.doi.org/10.1186/s12859-017-1721-x
work_keys_str_mv AT omaekatsuhiro quasilinearscoreforcapturingheterogeneousstructureinbiomarkers
AT komoriosamu quasilinearscoreforcapturingheterogeneousstructureinbiomarkers
AT eguchishinto quasilinearscoreforcapturingheterogeneousstructureinbiomarkers