Cargando…

Reference-free transcriptome signatures for prostate cancer prognosis

BACKGROUND: RNA-seq data are increasingly used to derive prognostic signatures for cancer outcome prediction. A limitation of current predictors is their reliance on reference gene annotations, which amounts to ignoring large numbers of non-canonical RNAs produced in disease tissues. A recently intr...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Ha T.N., Xue, Haoliang, Firlej, Virginie, Ponty, Yann, Gallopin, Melina, Gautheret, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8040209/
https://www.ncbi.nlm.nih.gov/pubmed/33845808
http://dx.doi.org/10.1186/s12885-021-08021-1
Descripción
Sumario:BACKGROUND: RNA-seq data are increasingly used to derive prognostic signatures for cancer outcome prediction. A limitation of current predictors is their reliance on reference gene annotations, which amounts to ignoring large numbers of non-canonical RNAs produced in disease tissues. A recently introduced kind of transcriptome classifier operates entirely in a reference-free manner, relying on k-mers extracted from patient RNA-seq data. METHODS: In this paper, we set out to compare conventional and reference-free signatures in risk and relapse prediction of prostate cancer. To compare the two approaches as fairly as possible, we set up a common procedure that takes as input either a k-mer count matrix or a gene expression matrix, extracts a signature and evaluates this signature in an independent dataset. RESULTS: We find that both gene-based and k-mer based classifiers had similarly high performances for risk prediction and a markedly lower performance for relapse prediction. Interestingly, the reference-free signatures included a set of sequences mapping to novel lncRNAs or variable regions of cancer driver genes that were not part of gene-based signatures. CONCLUSIONS: Reference-free classifiers are thus a promising strategy for the identification of novel prognostic RNA biomarkers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12885-021-08021-1).