Cargando…

DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling

MOTIVATION: Patient and sample diversity is one of the main challenges when dealing with clinical cohorts in biomedical genomics studies. During last decade, several methods have been developed to identify biomarkers assigned to specific individuals or subtypes of samples. However, current methods s...

Descripción completa

Detalles Bibliográficos
Autores principales: Campos-Laborie, F J, Risueño, A, Ortiz-Estévez, M, Rosón-Burgo, B, Droste, C, Fontanillo, C, Loos, R, Sánchez-Santos, J M, Trotter, M W, De Las Rivas, J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761977/
https://www.ncbi.nlm.nih.gov/pubmed/30824909
http://dx.doi.org/10.1093/bioinformatics/btz148
_version_ 1783454133529870336
author Campos-Laborie, F J
Risueño, A
Ortiz-Estévez, M
Rosón-Burgo, B
Droste, C
Fontanillo, C
Loos, R
Sánchez-Santos, J M
Trotter, M W
De Las Rivas, J
author_facet Campos-Laborie, F J
Risueño, A
Ortiz-Estévez, M
Rosón-Burgo, B
Droste, C
Fontanillo, C
Loos, R
Sánchez-Santos, J M
Trotter, M W
De Las Rivas, J
author_sort Campos-Laborie, F J
collection PubMed
description MOTIVATION: Patient and sample diversity is one of the main challenges when dealing with clinical cohorts in biomedical genomics studies. During last decade, several methods have been developed to identify biomarkers assigned to specific individuals or subtypes of samples. However, current methods still fail to discover markers in complex scenarios where heterogeneity or hidden phenotypical factors are present. Here, we propose a method to analyze and understand heterogeneous data avoiding classical normalization approaches of reducing or removing variation. RESULTS: DEcomposing heterogeneous Cohorts using Omic data profiling (DECO) is a method to find significant association among biological features (biomarkers) and samples (individuals) analyzing large-scale omic data. The method identifies and categorizes biomarkers of specific phenotypic conditions based on a recurrent differential analysis integrated with a non-symmetrical correspondence analysis. DECO integrates both omic data dispersion and predictor–response relationship from non-symmetrical correspondence analysis in a unique statistic (called h-statistic), allowing the identification of closely related sample categories within complex cohorts. The performance is demonstrated using simulated data and five experimental transcriptomic datasets, and comparing to seven other methods. We show DECO greatly enhances the discovery and subtle identification of biomarkers, making it especially suited for deep and accurate patient stratification. AVAILABILITY AND IMPLEMENTATION: DECO is freely available as an R package (including a practical vignette) at Bioconductor repository (http://bioconductor.org/packages/deco/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6761977
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67619772019-10-02 DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling Campos-Laborie, F J Risueño, A Ortiz-Estévez, M Rosón-Burgo, B Droste, C Fontanillo, C Loos, R Sánchez-Santos, J M Trotter, M W De Las Rivas, J Bioinformatics Original Papers MOTIVATION: Patient and sample diversity is one of the main challenges when dealing with clinical cohorts in biomedical genomics studies. During last decade, several methods have been developed to identify biomarkers assigned to specific individuals or subtypes of samples. However, current methods still fail to discover markers in complex scenarios where heterogeneity or hidden phenotypical factors are present. Here, we propose a method to analyze and understand heterogeneous data avoiding classical normalization approaches of reducing or removing variation. RESULTS: DEcomposing heterogeneous Cohorts using Omic data profiling (DECO) is a method to find significant association among biological features (biomarkers) and samples (individuals) analyzing large-scale omic data. The method identifies and categorizes biomarkers of specific phenotypic conditions based on a recurrent differential analysis integrated with a non-symmetrical correspondence analysis. DECO integrates both omic data dispersion and predictor–response relationship from non-symmetrical correspondence analysis in a unique statistic (called h-statistic), allowing the identification of closely related sample categories within complex cohorts. The performance is demonstrated using simulated data and five experimental transcriptomic datasets, and comparing to seven other methods. We show DECO greatly enhances the discovery and subtle identification of biomarkers, making it especially suited for deep and accurate patient stratification. AVAILABILITY AND IMPLEMENTATION: DECO is freely available as an R package (including a practical vignette) at Bioconductor repository (http://bioconductor.org/packages/deco/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-10-01 2019-03-01 /pmc/articles/PMC6761977/ /pubmed/30824909 http://dx.doi.org/10.1093/bioinformatics/btz148 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Campos-Laborie, F J
Risueño, A
Ortiz-Estévez, M
Rosón-Burgo, B
Droste, C
Fontanillo, C
Loos, R
Sánchez-Santos, J M
Trotter, M W
De Las Rivas, J
DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title_full DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title_fullStr DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title_full_unstemmed DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title_short DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
title_sort deco: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761977/
https://www.ncbi.nlm.nih.gov/pubmed/30824909
http://dx.doi.org/10.1093/bioinformatics/btz148
work_keys_str_mv AT camposlaboriefj decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT risuenoa decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT ortizestevezm decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT rosonburgob decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT drostec decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT fontanilloc decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT loosr decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT sanchezsantosjm decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT trottermw decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling
AT delasrivasj decodecomposeheterogeneouspopulationcohortsforpatientstratificationanddiscoveryofsamplebiomarkersusingomicdataprofiling