Cargando…

Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato

Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences,...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Sarah, Yu, Feng, Gu, Lianfeng, Min, Xiang Jia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6546887/
https://www.ncbi.nlm.nih.gov/pubmed/31191588
http://dx.doi.org/10.3389/fpls.2019.00689
_version_ 1783423596446613504
author Clark, Sarah
Yu, Feng
Gu, Lianfeng
Min, Xiang Jia
author_facet Clark, Sarah
Yu, Feng
Gu, Lianfeng
Min, Xiang Jia
author_sort Clark, Sarah
collection PubMed
description Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice).
format Online
Article
Text
id pubmed-6546887
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65468872019-06-12 Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato Clark, Sarah Yu, Feng Gu, Lianfeng Min, Xiang Jia Front Plant Sci Plant Science Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice). Frontiers Media S.A. 2019-05-28 /pmc/articles/PMC6546887/ /pubmed/31191588 http://dx.doi.org/10.3389/fpls.2019.00689 Text en Copyright © 2019 Clark, Yu, Gu and Min. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Clark, Sarah
Yu, Feng
Gu, Lianfeng
Min, Xiang Jia
Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title_full Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title_fullStr Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title_full_unstemmed Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title_short Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
title_sort expanding alternative splicing identification by integrating multiple sources of transcription data in tomato
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6546887/
https://www.ncbi.nlm.nih.gov/pubmed/31191588
http://dx.doi.org/10.3389/fpls.2019.00689
work_keys_str_mv AT clarksarah expandingalternativesplicingidentificationbyintegratingmultiplesourcesoftranscriptiondataintomato
AT yufeng expandingalternativesplicingidentificationbyintegratingmultiplesourcesoftranscriptiondataintomato
AT gulianfeng expandingalternativesplicingidentificationbyintegratingmultiplesourcesoftranscriptiondataintomato
AT minxiangjia expandingalternativesplicingidentificationbyintegratingmultiplesourcesoftranscriptiondataintomato