Cargando…

Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome

Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from...

Descripción completa

Detalles Bibliográficos
Autores principales: Gill, Navdeep, Buti, Matteo, Kane, Nolan, Bellec, Arnaud, Helmstetter, Nicolas, Berges, Hélène, Rieseberg, Loren H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4085609/
https://www.ncbi.nlm.nih.gov/pubmed/24833511
http://dx.doi.org/10.3390/biology3020295
_version_ 1782324688298442752
author Gill, Navdeep
Buti, Matteo
Kane, Nolan
Bellec, Arnaud
Helmstetter, Nicolas
Berges, Hélène
Rieseberg, Loren H.
author_facet Gill, Navdeep
Buti, Matteo
Kane, Nolan
Bellec, Arnaud
Helmstetter, Nicolas
Berges, Hélène
Rieseberg, Loren H.
author_sort Gill, Navdeep
collection PubMed
description Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
format Online
Article
Text
id pubmed-4085609
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-40856092014-07-08 Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome Gill, Navdeep Buti, Matteo Kane, Nolan Bellec, Arnaud Helmstetter, Nicolas Berges, Hélène Rieseberg, Loren H. Biology (Basel) Article Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence. MDPI 2014-04-16 /pmc/articles/PMC4085609/ /pubmed/24833511 http://dx.doi.org/10.3390/biology3020295 Text en © 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Gill, Navdeep
Buti, Matteo
Kane, Nolan
Bellec, Arnaud
Helmstetter, Nicolas
Berges, Hélène
Rieseberg, Loren H.
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title_full Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title_fullStr Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title_full_unstemmed Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title_short Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome
title_sort sequence-based analysis of structural organization and composition of the cultivated sunflower (helianthus annuus l.) genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4085609/
https://www.ncbi.nlm.nih.gov/pubmed/24833511
http://dx.doi.org/10.3390/biology3020295
work_keys_str_mv AT gillnavdeep sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT butimatteo sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT kanenolan sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT bellecarnaud sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT helmstetternicolas sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT bergeshelene sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome
AT rieseberglorenh sequencebasedanalysisofstructuralorganizationandcompositionofthecultivatedsunflowerhelianthusannuuslgenome