Cargando…
Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models
(1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9777708/ https://www.ncbi.nlm.nih.gov/pubmed/36553544 http://dx.doi.org/10.3390/genes13122275 |
_version_ | 1784856172104253440 |
---|---|
author | Jardillier, Rémy Koca, Dzenis Chatelain, Florent Guyon, Laurent |
author_facet | Jardillier, Rémy Koca, Dzenis Chatelain, Florent Guyon, Laurent |
author_sort | Jardillier, Rémy |
collection | PubMed |
description | (1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constraints, and a compromise has to be found. In the context of survival data, the goal of this work is to benchmark the impact of the number of patients and of the sequencing depth of miRNA-seq and mRNA-seq on the predictive capabilities for both the Cox model with elastic net penalty and random survival forest. (2) Results: we first show that the Cox model and random survival forest provide comparable prediction capabilities, with significant differences for some cancers. Second, we demonstrate that miRNA and/or mRNA data improve prediction over clinical data alone. mRNA-seq data leads to slightly better prediction than miRNA-seq, with the notable exception of lung adenocarcinoma for which the tumor miRNA profile shows higher predictive power. Third, we demonstrate that the sequencing depth of RNA-seq data can be reduced for most of the investigated cancers without degrading the prediction abilities, allowing the creation of independent validation sets at a lower cost. Finally, we show that the number of patients in the training dataset can be reduced for the Cox model and random survival forest, allowing the use of different models on different patient subgroups. |
format | Online Article Text |
id | pubmed-9777708 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97777082022-12-23 Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models Jardillier, Rémy Koca, Dzenis Chatelain, Florent Guyon, Laurent Genes (Basel) Article (1) Background: tumor profiling enables patient survival prediction. The two essential parameters to be calibrated when designing a study based on tumor profiles from a cohort are the sequencing depth of RNA-seq technology and the number of patients. This calibration is carried out under cost constraints, and a compromise has to be found. In the context of survival data, the goal of this work is to benchmark the impact of the number of patients and of the sequencing depth of miRNA-seq and mRNA-seq on the predictive capabilities for both the Cox model with elastic net penalty and random survival forest. (2) Results: we first show that the Cox model and random survival forest provide comparable prediction capabilities, with significant differences for some cancers. Second, we demonstrate that miRNA and/or mRNA data improve prediction over clinical data alone. mRNA-seq data leads to slightly better prediction than miRNA-seq, with the notable exception of lung adenocarcinoma for which the tumor miRNA profile shows higher predictive power. Third, we demonstrate that the sequencing depth of RNA-seq data can be reduced for most of the investigated cancers without degrading the prediction abilities, allowing the creation of independent validation sets at a lower cost. Finally, we show that the number of patients in the training dataset can be reduced for the Cox model and random survival forest, allowing the use of different models on different patient subgroups. MDPI 2022-12-02 /pmc/articles/PMC9777708/ /pubmed/36553544 http://dx.doi.org/10.3390/genes13122275 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jardillier, Rémy Koca, Dzenis Chatelain, Florent Guyon, Laurent Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title | Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title_full | Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title_fullStr | Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title_full_unstemmed | Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title_short | Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models |
title_sort | optimal microrna sequencing depth to predict cancer patient survival with random forest and cox models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9777708/ https://www.ncbi.nlm.nih.gov/pubmed/36553544 http://dx.doi.org/10.3390/genes13122275 |
work_keys_str_mv | AT jardillierremy optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels AT kocadzenis optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels AT chatelainflorent optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels AT guyonlaurent optimalmicrornasequencingdepthtopredictcancerpatientsurvivalwithrandomforestandcoxmodels |