Cargando…

Scalable Log-ratio Lasso Regression Enhances Microbiome Feature Selection for Predictive Models

Identifying predictive biomarkers of patient outcomes from high-throughput microbiome data is of high interest in contemporary cancer research. We present FLORAL, an open-source computational tool to perform scalable log-ratio lasso regression modeling and microbial feature selection for continuous,...

Descripción completa

Detalles Bibliográficos
Autores principales: Fei, Teng, Funnell, Tyler, Waters, Nicholas R., Raj, Sandeep S., Devlin, Sean M., Dai, Anqi, Miltiadous, Oriana, Shouval, Roni, Meng, Lv, Peled, Jonathan U., Ponce, Doris M., Perales, Miguel-Angel, Gönen, Mithat, van den Brink, Marcel R. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10187229/
https://www.ncbi.nlm.nih.gov/pubmed/37205350
http://dx.doi.org/10.1101/2023.05.02.538599
Descripción
Sumario:Identifying predictive biomarkers of patient outcomes from high-throughput microbiome data is of high interest in contemporary cancer research. We present FLORAL, an open-source computational tool to perform scalable log-ratio lasso regression modeling and microbial feature selection for continuous, binary, time-to-event, and competing risk outcomes. The proposed method adapts the augmented Lagrangian algorithm for a zero-sum constraint optimization problem while enabling a two-stage screening process for extended false-positive control. In extensive simulation studies, FLORAL achieved consistently better false-positive control compared to other lasso-based approaches and better variable selection [Formula: see text] score over popular differential abundance approaches. We demonstrate the practical utility of the proposed tool with a real data application on an allogeneic hematopoietic-cell transplantation cohort. The R package is available at https://github.com/vdblab/FLORAL.