Cargando…

Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery

SIMPLE SUMMARY: Omics analyses provide possibilities for molecular classification of cancers to enable personalized medicine. To allow for multi-layered molecular analysis, we developed an automated protocol for the generation of proteomics data of breast cancer tumor tissue that is subjected to par...

Descripción completa

Detalles Bibliográficos
Autores principales: Mosquim Junior, Sergio, Siino, Valentina, Rydén, Lisa, Vallon-Christersson, Johan, Levander, Fredrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9736226/
https://www.ncbi.nlm.nih.gov/pubmed/36497242
http://dx.doi.org/10.3390/cancers14235761
Descripción
Sumario:SIMPLE SUMMARY: Omics analyses provide possibilities for molecular classification of cancers to enable personalized medicine. To allow for multi-layered molecular analysis, we developed an automated protocol for the generation of proteomics data of breast cancer tumor tissue that is subjected to parallel transcriptome analysis. We compare different data acquisition strategies for proteomics and settle on data-independent acquisition, achieving high correlation with RNA between samples. The proteomics data were further used for functional analyses and tumor classification, showing the potential of the methodology. ABSTRACT: In recent years, several advances have been achieved in breast cancer (BC) classification and treatment. However, overdiagnosis, overtreatment, and recurrent disease are still significant causes of complication and death. Here, we present the development of a protocol aimed at parallel transcriptome and proteome analysis of BC tissue samples using mass spectrometry, via Data Dependent and Independent Acquisitions (DDA and DIA). Protein digestion was semi-automated and performed on flowthroughs after RNA extraction. Data for 116 samples were acquired in DDA and DIA modes and processed using MaxQuant, EncyclopeDIA, or DIA-NN. DIA-NN showed an increased number of identified proteins, reproducibility, and correlation with matching RNA-seq data, therefore representing the best alternative for this setup. Gene Set Enrichment Analysis pointed towards complementary information being found between transcriptomic and proteomic data. A decision tree model, designed to predict the intrinsic subtypes based on differentially abundant proteins across different conditions, selected protein groups that recapitulate important clinical features, such as estrogen receptor status, HER2 status, proliferation, and aggressiveness. Taken together, our results indicate that the proposed protocol performed well for the application. Additionally, the relevance of the selected proteins points to the possibility of using such data as a biomarker discovery tool for personalized medicine.