Cargando…

Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome

BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and...

Descripción completa

Detalles Bibliográficos
Autores principales: Larsen, Peter E., Trivedi, Geetika, Sreedasyam, Avinash, Lu, Vincent, Podila, Gopi K., Collart, Frank R.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2897884/
https://www.ncbi.nlm.nih.gov/pubmed/20625404
http://dx.doi.org/10.1371/journal.pone.0009780
_version_ 1782183451471904768
author Larsen, Peter E.
Trivedi, Geetika
Sreedasyam, Avinash
Lu, Vincent
Podila, Gopi K.
Collart, Frank R.
author_facet Larsen, Peter E.
Trivedi, Geetika
Sreedasyam, Avinash
Lu, Vincent
Podila, Gopi K.
Collart, Frank R.
author_sort Larsen, Peter E.
collection PubMed
description BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI “best” gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models.
format Text
id pubmed-2897884
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28978842010-07-12 Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome Larsen, Peter E. Trivedi, Geetika Sreedasyam, Avinash Lu, Vincent Podila, Gopi K. Collart, Frank R. PLoS One Research Article BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI “best” gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models. Public Library of Science 2010-07-06 /pmc/articles/PMC2897884/ /pubmed/20625404 http://dx.doi.org/10.1371/journal.pone.0009780 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Larsen, Peter E.
Trivedi, Geetika
Sreedasyam, Avinash
Lu, Vincent
Podila, Gopi K.
Collart, Frank R.
Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title_full Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title_fullStr Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title_full_unstemmed Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title_short Using Deep RNA Sequencing for the Structural Annotation of the Laccaria Bicolor Mycorrhizal Transcriptome
title_sort using deep rna sequencing for the structural annotation of the laccaria bicolor mycorrhizal transcriptome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2897884/
https://www.ncbi.nlm.nih.gov/pubmed/20625404
http://dx.doi.org/10.1371/journal.pone.0009780
work_keys_str_mv AT larsenpetere usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT trivedigeetika usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT sreedasyamavinash usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT luvincent usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT podilagopik usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT collartfrankr usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome