Cargando…

Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery

SIMPLE SUMMARY: Omics analyses provide possibilities for molecular classification of cancers to enable personalized medicine. To allow for multi-layered molecular analysis, we developed an automated protocol for the generation of proteomics data of breast cancer tumor tissue that is subjected to par...

Descripción completa

Detalles Bibliográficos
Autores principales: Mosquim Junior, Sergio, Siino, Valentina, Rydén, Lisa, Vallon-Christersson, Johan, Levander, Fredrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9736226/
https://www.ncbi.nlm.nih.gov/pubmed/36497242
http://dx.doi.org/10.3390/cancers14235761
_version_ 1784846971088928768
author Mosquim Junior, Sergio
Siino, Valentina
Rydén, Lisa
Vallon-Christersson, Johan
Levander, Fredrik
author_facet Mosquim Junior, Sergio
Siino, Valentina
Rydén, Lisa
Vallon-Christersson, Johan
Levander, Fredrik
author_sort Mosquim Junior, Sergio
collection PubMed
description SIMPLE SUMMARY: Omics analyses provide possibilities for molecular classification of cancers to enable personalized medicine. To allow for multi-layered molecular analysis, we developed an automated protocol for the generation of proteomics data of breast cancer tumor tissue that is subjected to parallel transcriptome analysis. We compare different data acquisition strategies for proteomics and settle on data-independent acquisition, achieving high correlation with RNA between samples. The proteomics data were further used for functional analyses and tumor classification, showing the potential of the methodology. ABSTRACT: In recent years, several advances have been achieved in breast cancer (BC) classification and treatment. However, overdiagnosis, overtreatment, and recurrent disease are still significant causes of complication and death. Here, we present the development of a protocol aimed at parallel transcriptome and proteome analysis of BC tissue samples using mass spectrometry, via Data Dependent and Independent Acquisitions (DDA and DIA). Protein digestion was semi-automated and performed on flowthroughs after RNA extraction. Data for 116 samples were acquired in DDA and DIA modes and processed using MaxQuant, EncyclopeDIA, or DIA-NN. DIA-NN showed an increased number of identified proteins, reproducibility, and correlation with matching RNA-seq data, therefore representing the best alternative for this setup. Gene Set Enrichment Analysis pointed towards complementary information being found between transcriptomic and proteomic data. A decision tree model, designed to predict the intrinsic subtypes based on differentially abundant proteins across different conditions, selected protein groups that recapitulate important clinical features, such as estrogen receptor status, HER2 status, proliferation, and aggressiveness. Taken together, our results indicate that the proposed protocol performed well for the application. Additionally, the relevance of the selected proteins points to the possibility of using such data as a biomarker discovery tool for personalized medicine.
format Online
Article
Text
id pubmed-9736226
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97362262022-12-11 Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery Mosquim Junior, Sergio Siino, Valentina Rydén, Lisa Vallon-Christersson, Johan Levander, Fredrik Cancers (Basel) Article SIMPLE SUMMARY: Omics analyses provide possibilities for molecular classification of cancers to enable personalized medicine. To allow for multi-layered molecular analysis, we developed an automated protocol for the generation of proteomics data of breast cancer tumor tissue that is subjected to parallel transcriptome analysis. We compare different data acquisition strategies for proteomics and settle on data-independent acquisition, achieving high correlation with RNA between samples. The proteomics data were further used for functional analyses and tumor classification, showing the potential of the methodology. ABSTRACT: In recent years, several advances have been achieved in breast cancer (BC) classification and treatment. However, overdiagnosis, overtreatment, and recurrent disease are still significant causes of complication and death. Here, we present the development of a protocol aimed at parallel transcriptome and proteome analysis of BC tissue samples using mass spectrometry, via Data Dependent and Independent Acquisitions (DDA and DIA). Protein digestion was semi-automated and performed on flowthroughs after RNA extraction. Data for 116 samples were acquired in DDA and DIA modes and processed using MaxQuant, EncyclopeDIA, or DIA-NN. DIA-NN showed an increased number of identified proteins, reproducibility, and correlation with matching RNA-seq data, therefore representing the best alternative for this setup. Gene Set Enrichment Analysis pointed towards complementary information being found between transcriptomic and proteomic data. A decision tree model, designed to predict the intrinsic subtypes based on differentially abundant proteins across different conditions, selected protein groups that recapitulate important clinical features, such as estrogen receptor status, HER2 status, proliferation, and aggressiveness. Taken together, our results indicate that the proposed protocol performed well for the application. Additionally, the relevance of the selected proteins points to the possibility of using such data as a biomarker discovery tool for personalized medicine. MDPI 2022-11-23 /pmc/articles/PMC9736226/ /pubmed/36497242 http://dx.doi.org/10.3390/cancers14235761 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mosquim Junior, Sergio
Siino, Valentina
Rydén, Lisa
Vallon-Christersson, Johan
Levander, Fredrik
Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title_full Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title_fullStr Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title_full_unstemmed Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title_short Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery
title_sort choice of high-throughput proteomics method affects data integration with transcriptomics and the potential use in biomarker discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9736226/
https://www.ncbi.nlm.nih.gov/pubmed/36497242
http://dx.doi.org/10.3390/cancers14235761
work_keys_str_mv AT mosquimjuniorsergio choiceofhighthroughputproteomicsmethodaffectsdataintegrationwithtranscriptomicsandthepotentialuseinbiomarkerdiscovery
AT siinovalentina choiceofhighthroughputproteomicsmethodaffectsdataintegrationwithtranscriptomicsandthepotentialuseinbiomarkerdiscovery
AT rydenlisa choiceofhighthroughputproteomicsmethodaffectsdataintegrationwithtranscriptomicsandthepotentialuseinbiomarkerdiscovery
AT vallonchristerssonjohan choiceofhighthroughputproteomicsmethodaffectsdataintegrationwithtranscriptomicsandthepotentialuseinbiomarkerdiscovery
AT levanderfredrik choiceofhighthroughputproteomicsmethodaffectsdataintegrationwithtranscriptomicsandthepotentialuseinbiomarkerdiscovery