Cargando…

Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms

BACKGROUND: Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Kung-Yen, Muddiman, David C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3146456/
https://www.ncbi.nlm.nih.gov/pubmed/21745387
http://dx.doi.org/10.1186/1471-2164-12-358
_version_ 1782209213461692416
author Chang, Kung-Yen
Muddiman, David C
author_facet Chang, Kung-Yen
Muddiman, David C
author_sort Chang, Kung-Yen
collection PubMed
description BACKGROUND: Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identified more proteins than with the annotated proteins in Aspergillus flavus. In this study, we aimed at finding a greater number of eligible splice variants based on newly available transcript sequences and the latest genome annotation. The improved database was then used to compare four search algorithms: Mascot, OMSSA, X! Tandem, and InsPecT. RESULTS: The updated alternative splicing database predicted 15833 putative protein variants, 61% more than the previous results. There was transcript evidence for 50% of the updated genes compared to the previous 35% coverage. Database searches were conducted using the same set of spectral data, search parameters, and protein database but with different algorithms. The false discovery rates of the peptide-spectrum matches were estimated < 2%. The numbers of the total identified proteins varied from 765 to 867 between algorithms. Whereas 42% (1651/3891) of peptide assignments were unanimous, the comparison showed that 51% (568/1114) of the RefSeq proteins and 15% (11/72) of the putative splice variants were inferred by all algorithms. 12 plausible isoforms were discovered by focusing on the consensus peptides which were detected by at least three different algorithms. The analysis found different conserved domains in two putative isoforms of UDP-galactose 4-epimerase. CONCLUSIONS: We were able to detect dozens of new peptides using the improved alternative splicing database with the recently updated annotation of the A. flavus genome. Unlike the identifications of the peptides and the RefSeq proteins, large variations existed between the putative splice variants identified by different algorithms. 12 candidates of putative isoforms were reported based on the consensus peptide-spectrum matches. This suggests that applications of multiple search engines effectively reduced the possible false positive results and validated the protein identifications from tandem mass spectra using an alternative splicing database.
format Online
Article
Text
id pubmed-3146456
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31464562011-07-30 Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms Chang, Kung-Yen Muddiman, David C BMC Genomics Research Article BACKGROUND: Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identified more proteins than with the annotated proteins in Aspergillus flavus. In this study, we aimed at finding a greater number of eligible splice variants based on newly available transcript sequences and the latest genome annotation. The improved database was then used to compare four search algorithms: Mascot, OMSSA, X! Tandem, and InsPecT. RESULTS: The updated alternative splicing database predicted 15833 putative protein variants, 61% more than the previous results. There was transcript evidence for 50% of the updated genes compared to the previous 35% coverage. Database searches were conducted using the same set of spectral data, search parameters, and protein database but with different algorithms. The false discovery rates of the peptide-spectrum matches were estimated < 2%. The numbers of the total identified proteins varied from 765 to 867 between algorithms. Whereas 42% (1651/3891) of peptide assignments were unanimous, the comparison showed that 51% (568/1114) of the RefSeq proteins and 15% (11/72) of the putative splice variants were inferred by all algorithms. 12 plausible isoforms were discovered by focusing on the consensus peptides which were detected by at least three different algorithms. The analysis found different conserved domains in two putative isoforms of UDP-galactose 4-epimerase. CONCLUSIONS: We were able to detect dozens of new peptides using the improved alternative splicing database with the recently updated annotation of the A. flavus genome. Unlike the identifications of the peptides and the RefSeq proteins, large variations existed between the putative splice variants identified by different algorithms. 12 candidates of putative isoforms were reported based on the consensus peptide-spectrum matches. This suggests that applications of multiple search engines effectively reduced the possible false positive results and validated the protein identifications from tandem mass spectra using an alternative splicing database. BioMed Central 2011-07-11 /pmc/articles/PMC3146456/ /pubmed/21745387 http://dx.doi.org/10.1186/1471-2164-12-358 Text en Copyright ©2011 Chang and Muddiman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chang, Kung-Yen
Muddiman, David C
Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title_full Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title_fullStr Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title_full_unstemmed Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title_short Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms
title_sort identification of alternative splice variants in aspergillus flavus through comparison of multiple tandem ms search algorithms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3146456/
https://www.ncbi.nlm.nih.gov/pubmed/21745387
http://dx.doi.org/10.1186/1471-2164-12-358
work_keys_str_mv AT changkungyen identificationofalternativesplicevariantsinaspergillusflavusthroughcomparisonofmultipletandemmssearchalgorithms
AT muddimandavidc identificationofalternativesplicevariantsinaspergillusflavusthroughcomparisonofmultipletandemmssearchalgorithms