Cargando…

Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication

BACKGROUND: Deep learning models can infer cancer patient prognosis from molecular and anatomic pathology information. Recent studies that leveraged information from complementary multimodal data improved prognostication, further illustrating the potential utility of such methods. However, current a...

Descripción completa

Detalles Bibliográficos
Autores principales: Azher, Zarif L., Suvarna, Anish, Chen, Ji-Qing, Zhang, Ze, Christensen, Brock C., Salas, Lucas A., Vaickus, Louis J., Levy, Joshua J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10363299/
https://www.ncbi.nlm.nih.gov/pubmed/37481666
http://dx.doi.org/10.1186/s13040-023-00338-w
Descripción
Sumario:BACKGROUND: Deep learning models can infer cancer patient prognosis from molecular and anatomic pathology information. Recent studies that leveraged information from complementary multimodal data improved prognostication, further illustrating the potential utility of such methods. However, current approaches: 1) do not comprehensively leverage biological and histomorphological relationships and 2) make use of emerging strategies to “pretrain” models (i.e., train models on a slightly orthogonal dataset/modeling objective) which may aid prognostication by reducing the amount of information required for achieving optimal performance. In addition, model interpretation is crucial for facilitating the clinical adoption of deep learning methods by fostering practitioner understanding and trust in the technology. METHODS: Here, we develop an interpretable multimodal modeling framework that combines DNA methylation, gene expression, and histopathology (i.e., tissue slides) data, and we compare performance of crossmodal pretraining, contrastive learning, and transfer learning versus the standard procedure. RESULTS: Our models outperform the existing state-of-the-art method (average 11.54% C-index increase), and baseline clinically driven models (average 11.7% C-index increase). Model interpretations elucidate consideration of biologically meaningful factors in making prognosis predictions. DISCUSSION: Our results demonstrate that the selection of pretraining strategies is crucial for obtaining highly accurate prognostication models, even more so than devising an innovative model architecture, and further emphasize the all-important role of the tumor microenvironment on disease progression. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13040-023-00338-w.