Cargando…

Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array

Dinoflagellates are one of the last major lineages of eukaryotes for which little is known about genome structure and organization. We report here the sequence and gene structure of a clone isolated from a cosmid library which, to our knowledge, represents the largest contiguously sequenced, dinofla...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendez, Gregory S., Delwiche, Charles F., Apt, Kirk E., Lippmeier, J. Casey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5032977/
https://www.ncbi.nlm.nih.gov/pubmed/25963315
http://dx.doi.org/10.1111/jeu.12230
_version_ 1782455088972824576
author Mendez, Gregory S.
Delwiche, Charles F.
Apt, Kirk E.
Lippmeier, J. Casey
author_facet Mendez, Gregory S.
Delwiche, Charles F.
Apt, Kirk E.
Lippmeier, J. Casey
author_sort Mendez, Gregory S.
collection PubMed
description Dinoflagellates are one of the last major lineages of eukaryotes for which little is known about genome structure and organization. We report here the sequence and gene structure of a clone isolated from a cosmid library which, to our knowledge, represents the largest contiguously sequenced, dinoflagellate genomic, tandem gene array. These data, combined with information from a large transcriptomic library, allowed a high level of confidence of every base pair call. This degree of confidence is not possible with PCR‐based contigs. The sequence contains an intron‐rich set of five highly expressed gene repeats arranged in tandem. One of the tandem repeat gene members contains an intron 26,372 bp long. This study characterizes a splice site consensus sequence for dinoflagellate introns. Two to nine base pairs around the 3′ splice site are repeated by an identical two to nine base pairs around the 5′ splice site. The 5′ and 3′ splice sites are in the same locations within each repeat so that the repeat is found only once in the mature mRNA. This identically repeated intron boundary sequence might be useful in gene modeling and annotation of genomes.
format Online
Article
Text
id pubmed-5032977
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-50329772016-10-03 Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array Mendez, Gregory S. Delwiche, Charles F. Apt, Kirk E. Lippmeier, J. Casey J Eukaryot Microbiol Original Articles Dinoflagellates are one of the last major lineages of eukaryotes for which little is known about genome structure and organization. We report here the sequence and gene structure of a clone isolated from a cosmid library which, to our knowledge, represents the largest contiguously sequenced, dinoflagellate genomic, tandem gene array. These data, combined with information from a large transcriptomic library, allowed a high level of confidence of every base pair call. This degree of confidence is not possible with PCR‐based contigs. The sequence contains an intron‐rich set of five highly expressed gene repeats arranged in tandem. One of the tandem repeat gene members contains an intron 26,372 bp long. This study characterizes a splice site consensus sequence for dinoflagellate introns. Two to nine base pairs around the 3′ splice site are repeated by an identical two to nine base pairs around the 5′ splice site. The 5′ and 3′ splice sites are in the same locations within each repeat so that the repeat is found only once in the mature mRNA. This identically repeated intron boundary sequence might be useful in gene modeling and annotation of genomes. John Wiley and Sons Inc. 2015-06-08 2015 /pmc/articles/PMC5032977/ /pubmed/25963315 http://dx.doi.org/10.1111/jeu.12230 Text en © 2015 The Authors. The Journal of Eukaryotic Microbiology published by Wiley Periodicals, Inc. on behalf of International Society of Protistologists This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Mendez, Gregory S.
Delwiche, Charles F.
Apt, Kirk E.
Lippmeier, J. Casey
Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title_full Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title_fullStr Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title_full_unstemmed Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title_short Dinoflagellate Gene Structure and Intron Splice Sites in a Genomic Tandem Array
title_sort dinoflagellate gene structure and intron splice sites in a genomic tandem array
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5032977/
https://www.ncbi.nlm.nih.gov/pubmed/25963315
http://dx.doi.org/10.1111/jeu.12230
work_keys_str_mv AT mendezgregorys dinoflagellategenestructureandintronsplicesitesinagenomictandemarray
AT delwichecharlesf dinoflagellategenestructureandintronsplicesitesinagenomictandemarray
AT aptkirke dinoflagellategenestructureandintronsplicesitesinagenomictandemarray
AT lippmeierjcasey dinoflagellategenestructureandintronsplicesitesinagenomictandemarray