Cargando…

Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq

BACKGROUND: Alternative splicing isoforms have been reported as a new and robust class of diagnostic biomarkers. Over 95% of human genes are estimated to be alternatively spliced as a powerful means of producing functionally diverse proteins from a single gene. The emergence of next-generation seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Fan, Deng, Chris K., Wang, Mu, Deng, Bin, Barber, Robert, Huang, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713335/
https://www.ncbi.nlm.nih.gov/pubmed/33272210
http://dx.doi.org/10.1186/s12859-020-03824-8
_version_ 1783618558661492736
author Zhang, Fan
Deng, Chris K.
Wang, Mu
Deng, Bin
Barber, Robert
Huang, Gang
author_facet Zhang, Fan
Deng, Chris K.
Wang, Mu
Deng, Bin
Barber, Robert
Huang, Gang
author_sort Zhang, Fan
collection PubMed
description BACKGROUND: Alternative splicing isoforms have been reported as a new and robust class of diagnostic biomarkers. Over 95% of human genes are estimated to be alternatively spliced as a powerful means of producing functionally diverse proteins from a single gene. The emergence of next-generation sequencing technologies, especially RNA-seq, provides novel insights into large-scale detection and analysis of alternative splicing at the transcriptional level. Advances in Proteomic Technologies such as liquid chromatography coupled tandem mass spectrometry (LC–MS/MS), have shown tremendous power for the parallel characterization of large amount of proteins in biological samples. Although poor correspondence has been generally found from previous qualitative comparative analysis between proteomics and microarray data, significantly higher degrees of correlation have been observed at the level of exon. Combining protein and RNA data by searching LC–MS/MS data against a customized protein database from RNA-Seq may produce a subset of alternatively spliced protein isoform candidates that have higher confidence. RESULTS: We developed a bioinformatics workflow to discover alternative splicing biomarkers from LC–MS/MS using RNA-Seq. First, we retrieved high confident, novel alternative splicing biomarkers from the breast cancer RNA-Seq database. Then, we translated these sequences into in silico Isoform Junction Peptides, and created a customized alternative splicing database for MS searching. Lastly, we ran the Open Mass spectrometry Search Algorithm against the customized alternative splicing database with breast cancer plasma proteome. Twenty six alternative splicing biomarker peptides with one single intron event and one exon skipping event were identified. Further interpretation of biological pathways with our Integrated Pathway Analysis Database showed that these 26 peptides are associated with Cancer, Signaling, Metabolism, Regulation, Immune System and Hemostasis pathways, which are consistent with the 256 alternative splicing biomarkers from the RNA-Seq. CONCLUSIONS: This paper presents a bioinformatics workflow for using RNA-seq data to discover novel alternative splicing biomarkers from the breast cancer proteome. As a complement to synthetic alternative splicing database technique for alternative splicing identification, this method combines the advantages of two platforms: mass spectrometry and next generation sequencing and can help identify potentially highly sample-specific alternative splicing isoform biomarkers at early-stage of cancer.
format Online
Article
Text
id pubmed-7713335
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77133352020-12-03 Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq Zhang, Fan Deng, Chris K. Wang, Mu Deng, Bin Barber, Robert Huang, Gang BMC Bioinformatics Research BACKGROUND: Alternative splicing isoforms have been reported as a new and robust class of diagnostic biomarkers. Over 95% of human genes are estimated to be alternatively spliced as a powerful means of producing functionally diverse proteins from a single gene. The emergence of next-generation sequencing technologies, especially RNA-seq, provides novel insights into large-scale detection and analysis of alternative splicing at the transcriptional level. Advances in Proteomic Technologies such as liquid chromatography coupled tandem mass spectrometry (LC–MS/MS), have shown tremendous power for the parallel characterization of large amount of proteins in biological samples. Although poor correspondence has been generally found from previous qualitative comparative analysis between proteomics and microarray data, significantly higher degrees of correlation have been observed at the level of exon. Combining protein and RNA data by searching LC–MS/MS data against a customized protein database from RNA-Seq may produce a subset of alternatively spliced protein isoform candidates that have higher confidence. RESULTS: We developed a bioinformatics workflow to discover alternative splicing biomarkers from LC–MS/MS using RNA-Seq. First, we retrieved high confident, novel alternative splicing biomarkers from the breast cancer RNA-Seq database. Then, we translated these sequences into in silico Isoform Junction Peptides, and created a customized alternative splicing database for MS searching. Lastly, we ran the Open Mass spectrometry Search Algorithm against the customized alternative splicing database with breast cancer plasma proteome. Twenty six alternative splicing biomarker peptides with one single intron event and one exon skipping event were identified. Further interpretation of biological pathways with our Integrated Pathway Analysis Database showed that these 26 peptides are associated with Cancer, Signaling, Metabolism, Regulation, Immune System and Hemostasis pathways, which are consistent with the 256 alternative splicing biomarkers from the RNA-Seq. CONCLUSIONS: This paper presents a bioinformatics workflow for using RNA-seq data to discover novel alternative splicing biomarkers from the breast cancer proteome. As a complement to synthetic alternative splicing database technique for alternative splicing identification, this method combines the advantages of two platforms: mass spectrometry and next generation sequencing and can help identify potentially highly sample-specific alternative splicing isoform biomarkers at early-stage of cancer. BioMed Central 2020-12-03 /pmc/articles/PMC7713335/ /pubmed/33272210 http://dx.doi.org/10.1186/s12859-020-03824-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Fan
Deng, Chris K.
Wang, Mu
Deng, Bin
Barber, Robert
Huang, Gang
Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title_full Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title_fullStr Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title_full_unstemmed Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title_short Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq
title_sort identification of novel alternative splicing biomarkers for breast cancer with lc/ms/ms and rna-seq
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713335/
https://www.ncbi.nlm.nih.gov/pubmed/33272210
http://dx.doi.org/10.1186/s12859-020-03824-8
work_keys_str_mv AT zhangfan identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq
AT dengchrisk identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq
AT wangmu identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq
AT dengbin identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq
AT barberrobert identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq
AT huanggang identificationofnovelalternativesplicingbiomarkersforbreastcancerwithlcmsmsandrnaseq