Cargando…

Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)

BACKGROUND: Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belongi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Peng, Silverstein, Kevin AT, Gao, Liangliang, Walton, Jonathan D, Nallu, Sumitha, Guhlin, Joseph, Young, Nevin D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3924332/
https://www.ncbi.nlm.nih.gov/pubmed/24256031
http://dx.doi.org/10.1186/1471-2105-14-335
_version_ 1782303726340407296
author Zhou, Peng
Silverstein, Kevin AT
Gao, Liangliang
Walton, Jonathan D
Nallu, Sumitha
Guhlin, Joseph
Young, Nevin D
author_facet Zhou, Peng
Silverstein, Kevin AT
Gao, Liangliang
Walton, Jonathan D
Nallu, Sumitha
Guhlin, Joseph
Young, Nevin D
author_sort Zhou, Peng
collection PubMed
description BACKGROUND: Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belonging to this class of genes. This is largely due to the high sequence divergence among family members and the limited availability of experimentally verified small peptides to use as training sets for homology search and ab initio prediction. Consequently, there is an urgent need for both experimental and computational studies in order to further advance the accurate prediction of small peptides. RESULTS: We present here a homology-based gene prediction program to accurately predict small peptides at the genome level. Given a high-quality profile alignment, SPADA identifies and annotates nearly all family members in tested genomes with better performance than all general-purpose gene prediction programs surveyed. We find numerous mis-annotations in the current Arabidopsis thaliana and Medicago truncatula genome databases using SPADA, most of which have RNA-Seq expression support. We also show that SPADA works well on other classes of small secreted peptides in plants (e.g., self-incompatibility protein homologues) as well as non-secreted peptides outside the plant kingdom (e.g., the alpha-amanitin toxin gene family in the mushroom, Amanita bisporigera). CONCLUSIONS: SPADA is a free software tool that accurately identifies and predicts the gene structure for short peptides with one or two exons. SPADA is able to incorporate information from profile alignments into the model prediction process and makes use of it to score different candidate models. SPADA achieves high sensitivity and specificity in predicting small plant peptides such as the cysteine-rich peptide families. A systematic application of SPADA to other classes of small peptides by research communities will greatly improve the genome annotation of different protein families in public genome databases.
format Online
Article
Text
id pubmed-3924332
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39243322014-03-03 Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application) Zhou, Peng Silverstein, Kevin AT Gao, Liangliang Walton, Jonathan D Nallu, Sumitha Guhlin, Joseph Young, Nevin D BMC Bioinformatics Methodology Article BACKGROUND: Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belonging to this class of genes. This is largely due to the high sequence divergence among family members and the limited availability of experimentally verified small peptides to use as training sets for homology search and ab initio prediction. Consequently, there is an urgent need for both experimental and computational studies in order to further advance the accurate prediction of small peptides. RESULTS: We present here a homology-based gene prediction program to accurately predict small peptides at the genome level. Given a high-quality profile alignment, SPADA identifies and annotates nearly all family members in tested genomes with better performance than all general-purpose gene prediction programs surveyed. We find numerous mis-annotations in the current Arabidopsis thaliana and Medicago truncatula genome databases using SPADA, most of which have RNA-Seq expression support. We also show that SPADA works well on other classes of small secreted peptides in plants (e.g., self-incompatibility protein homologues) as well as non-secreted peptides outside the plant kingdom (e.g., the alpha-amanitin toxin gene family in the mushroom, Amanita bisporigera). CONCLUSIONS: SPADA is a free software tool that accurately identifies and predicts the gene structure for short peptides with one or two exons. SPADA is able to incorporate information from profile alignments into the model prediction process and makes use of it to score different candidate models. SPADA achieves high sensitivity and specificity in predicting small plant peptides such as the cysteine-rich peptide families. A systematic application of SPADA to other classes of small peptides by research communities will greatly improve the genome annotation of different protein families in public genome databases. BioMed Central 2013-11-20 /pmc/articles/PMC3924332/ /pubmed/24256031 http://dx.doi.org/10.1186/1471-2105-14-335 Text en Copyright © 2013 Zhou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhou, Peng
Silverstein, Kevin AT
Gao, Liangliang
Walton, Jonathan D
Nallu, Sumitha
Guhlin, Joseph
Young, Nevin D
Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title_full Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title_fullStr Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title_full_unstemmed Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title_short Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)
title_sort detecting small plant peptides using spada (small peptide alignment discovery application)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3924332/
https://www.ncbi.nlm.nih.gov/pubmed/24256031
http://dx.doi.org/10.1186/1471-2105-14-335
work_keys_str_mv AT zhoupeng detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT silversteinkevinat detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT gaoliangliang detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT waltonjonathand detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT nallusumitha detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT guhlinjoseph detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication
AT youngnevind detectingsmallplantpeptidesusingspadasmallpeptidealignmentdiscoveryapplication