Cargando…

Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data

BACKGROUND: The inclusion of high-dimensional omics data in prediction models has become a well-studied topic in the last decades. Although most of these methods do not account for possibly different types of variables in the set of covariates available in the same dataset, there are many such scena...

Descripción completa

Detalles Bibliográficos
Autores principales: Klau, Simon, Jurinovic, Vindi, Hornung, Roman, Herold, Tobias, Boulesteix, Anne-Laure
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134797/
https://www.ncbi.nlm.nih.gov/pubmed/30208855
http://dx.doi.org/10.1186/s12859-018-2344-6
_version_ 1783354731774607360
author Klau, Simon
Jurinovic, Vindi
Hornung, Roman
Herold, Tobias
Boulesteix, Anne-Laure
author_facet Klau, Simon
Jurinovic, Vindi
Hornung, Roman
Herold, Tobias
Boulesteix, Anne-Laure
author_sort Klau, Simon
collection PubMed
description BACKGROUND: The inclusion of high-dimensional omics data in prediction models has become a well-studied topic in the last decades. Although most of these methods do not account for possibly different types of variables in the set of covariates available in the same dataset, there are many such scenarios where the variables can be structured in blocks of different types, e.g., clinical, transcriptomic, and methylation data. To date, there exist a few computationally intensive approaches that make use of block structures of this kind. RESULTS: In this paper we present priority-Lasso, an intuitive and practical analysis strategy for building prediction models based on Lasso that takes such block structures into account. It requires the definition of a priority order of blocks of data. Lasso models are calculated successively for every block and the fitted values of every step are included as an offset in the fit of the next step. We apply priority-Lasso in different settings on an acute myeloid leukemia (AML) dataset consisting of clinical variables, cytogenetics, gene mutations and expression variables, and compare its performance on an independent validation dataset to the performance of standard Lasso models. CONCLUSION: The results show that priority-Lasso is able to keep pace with Lasso in terms of prediction accuracy. Variables of blocks with higher priorities are favored over variables of blocks with lower priority, which results in easily usable and transportable models for clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2344-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6134797
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61347972018-09-15 Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data Klau, Simon Jurinovic, Vindi Hornung, Roman Herold, Tobias Boulesteix, Anne-Laure BMC Bioinformatics Methodology Article BACKGROUND: The inclusion of high-dimensional omics data in prediction models has become a well-studied topic in the last decades. Although most of these methods do not account for possibly different types of variables in the set of covariates available in the same dataset, there are many such scenarios where the variables can be structured in blocks of different types, e.g., clinical, transcriptomic, and methylation data. To date, there exist a few computationally intensive approaches that make use of block structures of this kind. RESULTS: In this paper we present priority-Lasso, an intuitive and practical analysis strategy for building prediction models based on Lasso that takes such block structures into account. It requires the definition of a priority order of blocks of data. Lasso models are calculated successively for every block and the fitted values of every step are included as an offset in the fit of the next step. We apply priority-Lasso in different settings on an acute myeloid leukemia (AML) dataset consisting of clinical variables, cytogenetics, gene mutations and expression variables, and compare its performance on an independent validation dataset to the performance of standard Lasso models. CONCLUSION: The results show that priority-Lasso is able to keep pace with Lasso in terms of prediction accuracy. Variables of blocks with higher priorities are favored over variables of blocks with lower priority, which results in easily usable and transportable models for clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2344-6) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-12 /pmc/articles/PMC6134797/ /pubmed/30208855 http://dx.doi.org/10.1186/s12859-018-2344-6 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Klau, Simon
Jurinovic, Vindi
Hornung, Roman
Herold, Tobias
Boulesteix, Anne-Laure
Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title_full Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title_fullStr Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title_full_unstemmed Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title_short Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
title_sort priority-lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134797/
https://www.ncbi.nlm.nih.gov/pubmed/30208855
http://dx.doi.org/10.1186/s12859-018-2344-6
work_keys_str_mv AT klausimon prioritylassoasimplehierarchicalapproachtothepredictionofclinicaloutcomeusingmultiomicsdata
AT jurinovicvindi prioritylassoasimplehierarchicalapproachtothepredictionofclinicaloutcomeusingmultiomicsdata
AT hornungroman prioritylassoasimplehierarchicalapproachtothepredictionofclinicaloutcomeusingmultiomicsdata
AT heroldtobias prioritylassoasimplehierarchicalapproachtothepredictionofclinicaloutcomeusingmultiomicsdata
AT boulesteixannelaure prioritylassoasimplehierarchicalapproachtothepredictionofclinicaloutcomeusingmultiomicsdata