Cargando…

Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites

Most protein-encoding genes in eukaryotes contain introns, which are interwoven with exons. Introns need to be removed from initial transcripts in order to generate the final messenger RNA (mRNA), which can be translated into an amino acid sequence. Precise excision of introns by the spliceosome req...

Descripción completa

Detalles Bibliográficos
Autores principales: Frey, Katharina, Pucker, Boas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072748/
https://www.ncbi.nlm.nih.gov/pubmed/32085510
http://dx.doi.org/10.3390/cells9020458
_version_ 1783506478865317888
author Frey, Katharina
Pucker, Boas
author_facet Frey, Katharina
Pucker, Boas
author_sort Frey, Katharina
collection PubMed
description Most protein-encoding genes in eukaryotes contain introns, which are interwoven with exons. Introns need to be removed from initial transcripts in order to generate the final messenger RNA (mRNA), which can be translated into an amino acid sequence. Precise excision of introns by the spliceosome requires conserved dinucleotides, which mark the splice sites. However, there are variations of the highly conserved combination of GT at the 5′ end and AG at the 3′ end of an intron in the genome. GC-AG and AT-AC are two major non-canonical splice site combinations, which have been known for years. Recently, various minor non-canonical splice site combinations were detected with numerous dinucleotide permutations. Here, we expand systematic investigations of non-canonical splice site combinations in plants across eukaryotes by analyzing fungal and animal genome sequences. Comparisons of splice site combinations between these three kingdoms revealed several differences, such as an apparently increased CT-AC frequency in fungal genome sequences. Canonical GT-AG splice site combinations in antisense transcripts are a likely explanation for this observation, thus indicating annotation errors. In addition, high numbers of GA-AG splice site combinations were observed in Eurytemora affinis and Oikopleura dioica. A variant in one U1 small nuclear RNA (snRNA) isoform might allow the recognition of GA as a 5′ splice site. In depth investigation of splice site usage based on RNA-Seq read mappings indicates a generally higher flexibility of the 3′ splice site compared to the 5′ splice site across animals, fungi, and plants.
format Online
Article
Text
id pubmed-7072748
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70727482020-03-19 Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites Frey, Katharina Pucker, Boas Cells Article Most protein-encoding genes in eukaryotes contain introns, which are interwoven with exons. Introns need to be removed from initial transcripts in order to generate the final messenger RNA (mRNA), which can be translated into an amino acid sequence. Precise excision of introns by the spliceosome requires conserved dinucleotides, which mark the splice sites. However, there are variations of the highly conserved combination of GT at the 5′ end and AG at the 3′ end of an intron in the genome. GC-AG and AT-AC are two major non-canonical splice site combinations, which have been known for years. Recently, various minor non-canonical splice site combinations were detected with numerous dinucleotide permutations. Here, we expand systematic investigations of non-canonical splice site combinations in plants across eukaryotes by analyzing fungal and animal genome sequences. Comparisons of splice site combinations between these three kingdoms revealed several differences, such as an apparently increased CT-AC frequency in fungal genome sequences. Canonical GT-AG splice site combinations in antisense transcripts are a likely explanation for this observation, thus indicating annotation errors. In addition, high numbers of GA-AG splice site combinations were observed in Eurytemora affinis and Oikopleura dioica. A variant in one U1 small nuclear RNA (snRNA) isoform might allow the recognition of GA as a 5′ splice site. In depth investigation of splice site usage based on RNA-Seq read mappings indicates a generally higher flexibility of the 3′ splice site compared to the 5′ splice site across animals, fungi, and plants. MDPI 2020-02-18 /pmc/articles/PMC7072748/ /pubmed/32085510 http://dx.doi.org/10.3390/cells9020458 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Frey, Katharina
Pucker, Boas
Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title_full Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title_fullStr Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title_full_unstemmed Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title_short Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
title_sort animal, fungi, and plant genome sequences harbor different non-canonical splice sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072748/
https://www.ncbi.nlm.nih.gov/pubmed/32085510
http://dx.doi.org/10.3390/cells9020458
work_keys_str_mv AT freykatharina animalfungiandplantgenomesequencesharbordifferentnoncanonicalsplicesites
AT puckerboas animalfungiandplantgenomesequencesharbordifferentnoncanonicalsplicesites