Cargando…

Evaluation of top-down mass spectral identification with homologous protein sequences

BACKGROUND: Top-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations. Most software tools in this area search top-down mass spectra against a protein sequence database for proteoform identification. When the...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ziwei, He, Bo, Kou, Qiang, Wang, Zhe, Wu, Si, Liu, Yunlong, Feng, Weixing, Liu, Xiaowen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6309053/
https://www.ncbi.nlm.nih.gov/pubmed/30591035
http://dx.doi.org/10.1186/s12859-018-2462-1
_version_ 1783383329260699648
author Li, Ziwei
He, Bo
Kou, Qiang
Wang, Zhe
Wu, Si
Liu, Yunlong
Feng, Weixing
Liu, Xiaowen
author_facet Li, Ziwei
He, Bo
Kou, Qiang
Wang, Zhe
Wu, Si
Liu, Yunlong
Feng, Weixing
Liu, Xiaowen
author_sort Li, Ziwei
collection PubMed
description BACKGROUND: Top-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations. Most software tools in this area search top-down mass spectra against a protein sequence database for proteoform identification. When the species studied in a mass spectrometry experiment lacks its proteome sequence database, a homologous protein sequence database can be used for proteoform identification. The accuracy of homologous protein sequences affects the sensitivity of proteoform identification and the accuracy of mass shift localization. RESULTS: We tested TopPIC, a commonly used software tool for top-down mass spectral identification, on a top-down mass spectrometry data set of Escherichia coli K12 MG1655, and evaluated its performance using an Escherichia coli K12 MG1655 proteome database and a homologous protein database. The number of identified spectra with the homologous database was about half of that with the Escherichia coli K12 MG1655 database. We also tested TopPIC on a top-down mass spectrometry data set of human MCF-7 cells and obtained similar results. CONCLUSIONS: Experimental results demonstrated that TopPIC is capable of identifying many proteoform spectrum matches and localizing unknown alterations using homologous protein sequences containing no more than 2 mutations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2462-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6309053
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63090532019-01-03 Evaluation of top-down mass spectral identification with homologous protein sequences Li, Ziwei He, Bo Kou, Qiang Wang, Zhe Wu, Si Liu, Yunlong Feng, Weixing Liu, Xiaowen BMC Bioinformatics Research BACKGROUND: Top-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations. Most software tools in this area search top-down mass spectra against a protein sequence database for proteoform identification. When the species studied in a mass spectrometry experiment lacks its proteome sequence database, a homologous protein sequence database can be used for proteoform identification. The accuracy of homologous protein sequences affects the sensitivity of proteoform identification and the accuracy of mass shift localization. RESULTS: We tested TopPIC, a commonly used software tool for top-down mass spectral identification, on a top-down mass spectrometry data set of Escherichia coli K12 MG1655, and evaluated its performance using an Escherichia coli K12 MG1655 proteome database and a homologous protein database. The number of identified spectra with the homologous database was about half of that with the Escherichia coli K12 MG1655 database. We also tested TopPIC on a top-down mass spectrometry data set of human MCF-7 cells and obtained similar results. CONCLUSIONS: Experimental results demonstrated that TopPIC is capable of identifying many proteoform spectrum matches and localizing unknown alterations using homologous protein sequences containing no more than 2 mutations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2462-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-28 /pmc/articles/PMC6309053/ /pubmed/30591035 http://dx.doi.org/10.1186/s12859-018-2462-1 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Li, Ziwei
He, Bo
Kou, Qiang
Wang, Zhe
Wu, Si
Liu, Yunlong
Feng, Weixing
Liu, Xiaowen
Evaluation of top-down mass spectral identification with homologous protein sequences
title Evaluation of top-down mass spectral identification with homologous protein sequences
title_full Evaluation of top-down mass spectral identification with homologous protein sequences
title_fullStr Evaluation of top-down mass spectral identification with homologous protein sequences
title_full_unstemmed Evaluation of top-down mass spectral identification with homologous protein sequences
title_short Evaluation of top-down mass spectral identification with homologous protein sequences
title_sort evaluation of top-down mass spectral identification with homologous protein sequences
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6309053/
https://www.ncbi.nlm.nih.gov/pubmed/30591035
http://dx.doi.org/10.1186/s12859-018-2462-1
work_keys_str_mv AT liziwei evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT hebo evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT kouqiang evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT wangzhe evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT wusi evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT liuyunlong evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT fengweixing evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences
AT liuxiaowen evaluationoftopdownmassspectralidentificationwithhomologousproteinsequences