Cargando…

Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models

(1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constr...

Descripción completa

Detalles Bibliográficos
Autores principales: Jardillier, Rémy, Koca, Dzenis, Chatelain, Florent, Guyon, Laurent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9777708/
https://www.ncbi.nlm.nih.gov/pubmed/36553544
http://dx.doi.org/10.3390/genes13122275
_version_ 1784856172104253440
author Jardillier, Rémy
Koca, Dzenis
Chatelain, Florent
Guyon, Laurent
author_facet Jardillier, Rémy
Koca, Dzenis
Chatelain, Florent
Guyon, Laurent
author_sort Jardillier, Rémy
collection PubMed
description (1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constraints, and a compromise has to be found. In the context of survival data, the goal of this work is to benchmark the impact of the number of patients and of the sequencing depth of miRNA-seq and mRNA-seq on the predictive capabilities for both the Cox model with elastic net penalty and random survival forest. (2) Results: we first show that the Cox model and random survival forest provide comparable prediction capabilities, with significant differences for some cancers. Second, we demonstrate that miRNA and/or mRNA data improve prediction over clinical data alone. mRNA-seq data leads to slightly better prediction than miRNA-seq, with the notable exception of lung adenocarcinoma for which the tumor miRNA profile shows higher predictive power. Third, we demonstrate that the sequencing depth of RNA-seq data can be reduced for most of the investigated cancers without degrading the prediction abilities, allowing the creation of independent validation sets at a lower cost. Finally, we show that the number of patients in the training dataset can be reduced for the Cox model and random survival forest, allowing the use of different models on different patient subgroups.
format Online
Article
Text
id pubmed-9777708
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97777082022-12-23 Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models Jardillier, Rémy Koca, Dzenis Chatelain, Florent Guyon, Laurent Genes (Basel) Article (1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constraints, and a compromise has to be found. In the context of survival data, the goal of this work is to benchmark the impact of the number of patients and of the sequencing depth of miRNA-seq and mRNA-seq on the predictive capabilities for both the Cox model with elastic net penalty and random survival forest. (2) Results: we first show that the Cox model and random survival forest provide comparable prediction capabilities, with significant differences for some cancers. Second, we demonstrate that miRNA and/or mRNA data improve prediction over clinical data alone. mRNA-seq data leads to slightly better prediction than miRNA-seq, with the notable exception of lung adenocarcinoma for which the tumor miRNA profile shows higher predictive power. Third, we demonstrate that the sequencing depth of RNA-seq data can be reduced for most of the investigated cancers without degrading the prediction abilities, allowing the creation of independent validation sets at a lower cost. Finally, we show that the number of patients in the training dataset can be reduced for the Cox model and random survival forest, allowing the use of different models on different patient subgroups. MDPI 2022-12-02 /pmc/articles/PMC9777708/ /pubmed/36553544 http://dx.doi.org/10.3390/genes13122275 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jardillier, Rémy
Koca, Dzenis
Chatelain, Florent
Guyon, Laurent
Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title_full Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title_fullStr Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title_full_unstemmed Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title_short Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
title_sort optimal microrna sequencing depth to predict cancer patient survival with random forest and cox models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9777708/
https://www.ncbi.nlm.nih.gov/pubmed/36553544
http://dx.doi.org/10.3390/genes13122275
work_keys_str_mv AT jardillierremy optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels
AT kocadzenis optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels
AT chatelainflorent optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels
AT guyonlaurent optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels