Cargando…

YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools

Background: The transcriptome, a treasure trove of gene space information, remains severely under-used by current genome annotation methods.  Methods: Here, we present an annotation method in the YeATS suite (YeATSAM), based on information encoded by the transcriptome, that demonstrates artifacts of...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakraborty, Sandeep, Martínez-García, Pedro J., Dandekar, Abhaya M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5200947/
https://www.ncbi.nlm.nih.gov/pubmed/28105312
http://dx.doi.org/10.12688/f1000research.10040.1
_version_ 1782489272014602240
author Chakraborty, Sandeep
Martínez-García, Pedro J.
Dandekar, Abhaya M.
author_facet Chakraborty, Sandeep
Martínez-García, Pedro J.
Dandekar, Abhaya M.
author_sort Chakraborty, Sandeep
collection PubMed
description Background: The transcriptome, a treasure trove of gene space information, remains severely under-used by current genome annotation methods.  Methods: Here, we present an annotation method in the YeATS suite (YeATSAM), based on information encoded by the transcriptome, that demonstrates artifacts of the assembler, which must be addressed to achieve proper annotation.  Results and Discussion: YeATSAM was applied to the transcriptome obtained from twenty walnut tissues and compared to MAKER-P annotation of the recently published walnut genome sequence (WGS). MAKER-P and YeATSAM both failed to annotate several hundred proteins found by the other. Although many of these unannotated proteins have repetitive sequences (possibly transposable elements), other crucial proteins were excluded by each method. An egg cell-secreted protein and a homer protein were undetected by YeATSAM, although these did not produce any transcripts. Importantly, MAKER-P failed to classify key photosynthesis-related proteins, which we show emanated from Trinity assembly artifacts potentially not handled by MAKER-P. Also, no proteins from the large berberine bridge enzyme (BBE) family were annotated by MAKER-P. BBE is implicated in biosynthesis of several alkaloids metabolites, like anti-microbial berberine. As further validation, YeATSAM identified ~1000 genes that are not annotated in the NCBI database by Gnomon. YeATSAM used a RNA-seq derived chickpea ( Cicer arietinum L.) transcriptome assembled using Newbler v2.3.  Conclusions: Since the current version of YeATSAM does not have an ab initio module, we suggest a combined annotation scheme using both MAKER-P and YeATSAM to comprehensively and accurately annotate the WGS.
format Online
Article
Text
id pubmed-5200947
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-52009472017-01-18 YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools Chakraborty, Sandeep Martínez-García, Pedro J. Dandekar, Abhaya M. F1000Res Research Article Background: The transcriptome, a treasure trove of gene space information, remains severely under-used by current genome annotation methods.  Methods: Here, we present an annotation method in the YeATS suite (YeATSAM), based on information encoded by the transcriptome, that demonstrates artifacts of the assembler, which must be addressed to achieve proper annotation.  Results and Discussion: YeATSAM was applied to the transcriptome obtained from twenty walnut tissues and compared to MAKER-P annotation of the recently published walnut genome sequence (WGS). MAKER-P and YeATSAM both failed to annotate several hundred proteins found by the other. Although many of these unannotated proteins have repetitive sequences (possibly transposable elements), other crucial proteins were excluded by each method. An egg cell-secreted protein and a homer protein were undetected by YeATSAM, although these did not produce any transcripts. Importantly, MAKER-P failed to classify key photosynthesis-related proteins, which we show emanated from Trinity assembly artifacts potentially not handled by MAKER-P. Also, no proteins from the large berberine bridge enzyme (BBE) family were annotated by MAKER-P. BBE is implicated in biosynthesis of several alkaloids metabolites, like anti-microbial berberine. As further validation, YeATSAM identified ~1000 genes that are not annotated in the NCBI database by Gnomon. YeATSAM used a RNA-seq derived chickpea ( Cicer arietinum L.) transcriptome assembled using Newbler v2.3.  Conclusions: Since the current version of YeATSAM does not have an ab initio module, we suggest a combined annotation scheme using both MAKER-P and YeATSAM to comprehensively and accurately annotate the WGS. F1000Research 2016-11-17 /pmc/articles/PMC5200947/ /pubmed/28105312 http://dx.doi.org/10.12688/f1000research.10040.1 Text en Copyright: © 2016 Chakraborty S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chakraborty, Sandeep
Martínez-García, Pedro J.
Dandekar, Abhaya M.
YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title_full YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title_fullStr YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title_full_unstemmed YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title_short YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
title_sort yeatsam analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5200947/
https://www.ncbi.nlm.nih.gov/pubmed/28105312
http://dx.doi.org/10.12688/f1000research.10040.1
work_keys_str_mv AT chakrabortysandeep yeatsamanalysisofthewalnutandchickpeatranscriptomerevealskeygenesundetectedbycurrentannotationtools
AT martinezgarciapedroj yeatsamanalysisofthewalnutandchickpeatranscriptomerevealskeygenesundetectedbycurrentannotationtools
AT dandekarabhayam yeatsamanalysisofthewalnutandchickpeatranscriptomerevealskeygenesundetectedbycurrentannotationtools