Cargando…

NLR-parser: rapid annotation of plant NLR complements

Motivation: The repetitive nature of plant disease resistance genes encoding for nucleotide-binding leucine-rich repeat (NLR) proteins hampers their prediction with standard gene annotation software. Motif alignment and search tool (MAST) has previously been reported as a tool to support annotation...

Descripción completa

Detalles Bibliográficos
Autores principales: Steuernagel, Burkhard, Jupe, Florian, Witek, Kamil, Jones, Jonathan D.G., Wulff, Brande B.H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426836/
https://www.ncbi.nlm.nih.gov/pubmed/25586514
http://dx.doi.org/10.1093/bioinformatics/btv005
_version_ 1782370643361136640
author Steuernagel, Burkhard
Jupe, Florian
Witek, Kamil
Jones, Jonathan D.G.
Wulff, Brande B.H.
author_facet Steuernagel, Burkhard
Jupe, Florian
Witek, Kamil
Jones, Jonathan D.G.
Wulff, Brande B.H.
author_sort Steuernagel, Burkhard
collection PubMed
description Motivation: The repetitive nature of plant disease resistance genes encoding for nucleotide-binding leucine-rich repeat (NLR) proteins hampers their prediction with standard gene annotation software. Motif alignment and search tool (MAST) has previously been reported as a tool to support annotation of NLR-encoding genes. However, the decision if a motif combination represents an NLR protein was entirely manual. Results: The NLR-parser pipeline is designed to use the MAST output from six-frame translated amino acid sequences and filters for predefined biologically curated motif compositions. Input reads can be derived from, for example, raw long-read sequencing data or contigs and scaffolds coming from plant genome projects. The output is a tab-separated file with information on start and frame of the first NLR specific motif, whether the identified sequence is a TNL or CNL, potentially full or fragmented. In addition, the output of the NB-ARC domain sequence can directly be used for phylogenetic analyses. In comparison to other prediction software, the highly complex NB-ARC domain is described in detail using several individual motifs. Availability and implementation: The NLR-parser tool can be downloaded from Git-Hub (github.com/steuernb/NLR-Parser). It requires a valid Java installation as well as MAST as part of the MEME Suite. The tool is run from the command line. Contact: burkhard.steuernagel@jic.ac.uk; fjupe@salk.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4426836
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44268362015-05-15 NLR-parser: rapid annotation of plant NLR complements Steuernagel, Burkhard Jupe, Florian Witek, Kamil Jones, Jonathan D.G. Wulff, Brande B.H. Bioinformatics Applications Notes Motivation: The repetitive nature of plant disease resistance genes encoding for nucleotide-binding leucine-rich repeat (NLR) proteins hampers their prediction with standard gene annotation software. Motif alignment and search tool (MAST) has previously been reported as a tool to support annotation of NLR-encoding genes. However, the decision if a motif combination represents an NLR protein was entirely manual. Results: The NLR-parser pipeline is designed to use the MAST output from six-frame translated amino acid sequences and filters for predefined biologically curated motif compositions. Input reads can be derived from, for example, raw long-read sequencing data or contigs and scaffolds coming from plant genome projects. The output is a tab-separated file with information on start and frame of the first NLR specific motif, whether the identified sequence is a TNL or CNL, potentially full or fragmented. In addition, the output of the NB-ARC domain sequence can directly be used for phylogenetic analyses. In comparison to other prediction software, the highly complex NB-ARC domain is described in detail using several individual motifs. Availability and implementation: The NLR-parser tool can be downloaded from Git-Hub (github.com/steuernb/NLR-Parser). It requires a valid Java installation as well as MAST as part of the MEME Suite. The tool is run from the command line. Contact: burkhard.steuernagel@jic.ac.uk; fjupe@salk.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-05-15 2015-01-12 /pmc/articles/PMC4426836/ /pubmed/25586514 http://dx.doi.org/10.1093/bioinformatics/btv005 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Steuernagel, Burkhard
Jupe, Florian
Witek, Kamil
Jones, Jonathan D.G.
Wulff, Brande B.H.
NLR-parser: rapid annotation of plant NLR complements
title NLR-parser: rapid annotation of plant NLR complements
title_full NLR-parser: rapid annotation of plant NLR complements
title_fullStr NLR-parser: rapid annotation of plant NLR complements
title_full_unstemmed NLR-parser: rapid annotation of plant NLR complements
title_short NLR-parser: rapid annotation of plant NLR complements
title_sort nlr-parser: rapid annotation of plant nlr complements
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426836/
https://www.ncbi.nlm.nih.gov/pubmed/25586514
http://dx.doi.org/10.1093/bioinformatics/btv005
work_keys_str_mv AT steuernagelburkhard nlrparserrapidannotationofplantnlrcomplements
AT jupeflorian nlrparserrapidannotationofplantnlrcomplements
AT witekkamil nlrparserrapidannotationofplantnlrcomplements
AT jonesjonathandg nlrparserrapidannotationofplantnlrcomplements
AT wulffbrandebh nlrparserrapidannotationofplantnlrcomplements