Cargando…

ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R

The stage of cancer is a discrete ordinal response that indicates the aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as t...

Descripción completa

Detalles Bibliográficos
Autores principales: Archer, Kellie J., Seffernick, Anna Eames, Sun, Shuai, Zhang, Yiran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9097970/
https://www.ncbi.nlm.nih.gov/pubmed/35574500
http://dx.doi.org/10.3390/stats5020021
_version_ 1784706279547076608
author Archer, Kellie J.
Seffernick, Anna Eames
Sun, Shuai
Zhang, Yiran
author_facet Archer, Kellie J.
Seffernick, Anna Eames
Sun, Shuai
Zhang, Yiran
author_sort Archer, Kellie J.
collection PubMed
description The stage of cancer is a discrete ordinal response that indicates the aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as the level of spread. It may be of clinical relevance to identify molecular features from high-throughput genomic assays that are associated with the stage of cervical cancer to elucidate pathways related to tumor aggressiveness, identify improved molecular features that may be useful for staging, and identify therapeutic targets. High-throughput RNA-Seq data and corresponding clinical data (including stage) for cervical cancer patients have been made available through The Cancer Genome Atlas Project (TCGA). We recently described penalized Bayesian ordinal response models that can be used for variable selection for over-parameterized datasets, such as the TCGA-CESC dataset. Herein, we describe our ordinalbayes R package, available from the Comprehensive R Archive Network (CRAN), which enhances the runjags R package by enabling users to easily fit cumulative logit models when the outcome is ordinal and the number of predictors exceeds the sample size, P > N, such as for TCGA and other high-throughput genomic data. We demonstrate the use of this package by applying it to the TCGA cervical cancer dataset. Our ordinalbayes package can be used to fit models to high-dimensional datasets, and it effectively performs variable selection.
format Online
Article
Text
id pubmed-9097970
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-90979702022-06-01 ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R Archer, Kellie J. Seffernick, Anna Eames Sun, Shuai Zhang, Yiran Stats (Basel) Article The stage of cancer is a discrete ordinal response that indicates the aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as the level of spread. It may be of clinical relevance to identify molecular features from high-throughput genomic assays that are associated with the stage of cervical cancer to elucidate pathways related to tumor aggressiveness, identify improved molecular features that may be useful for staging, and identify therapeutic targets. High-throughput RNA-Seq data and corresponding clinical data (including stage) for cervical cancer patients have been made available through The Cancer Genome Atlas Project (TCGA). We recently described penalized Bayesian ordinal response models that can be used for variable selection for over-parameterized datasets, such as the TCGA-CESC dataset. Herein, we describe our ordinalbayes R package, available from the Comprehensive R Archive Network (CRAN), which enhances the runjags R package by enabling users to easily fit cumulative logit models when the outcome is ordinal and the number of predictors exceeds the sample size, P > N, such as for TCGA and other high-throughput genomic data. We demonstrate the use of this package by applying it to the TCGA cervical cancer dataset. Our ordinalbayes package can be used to fit models to high-dimensional datasets, and it effectively performs variable selection. 2022-06 2022-04-15 /pmc/articles/PMC9097970/ /pubmed/35574500 http://dx.doi.org/10.3390/stats5020021 Text en https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Archer, Kellie J.
Seffernick, Anna Eames
Sun, Shuai
Zhang, Yiran
ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title_full ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title_fullStr ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title_full_unstemmed ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title_short ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
title_sort ordinalbayes: fitting ordinal bayesian regression models to high-dimensional data using r
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9097970/
https://www.ncbi.nlm.nih.gov/pubmed/35574500
http://dx.doi.org/10.3390/stats5020021
work_keys_str_mv AT archerkelliej ordinalbayesfittingordinalbayesianregressionmodelstohighdimensionaldatausingr
AT seffernickannaeames ordinalbayesfittingordinalbayesianregressionmodelstohighdimensionaldatausingr
AT sunshuai ordinalbayesfittingordinalbayesianregressionmodelstohighdimensionaldatausingr
AT zhangyiran ordinalbayesfittingordinalbayesianregressionmodelstohighdimensionaldatausingr