Cargando…

The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads

BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by usin...

Descripción completa

Detalles Bibliográficos
Autores principales: Natali, Lucia, Cossu, Rosa Maria, Barghini, Elena, Giordani, Tommaso, Buti, Matteo, Mascagni, Flavia, Morgante, Michele, Gill, Navdeep, Kane, Nolan C, Rieseberg, Loren, Cavallini, Andrea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852528/
https://www.ncbi.nlm.nih.gov/pubmed/24093210
http://dx.doi.org/10.1186/1471-2164-14-686
_version_ 1782478680041193472
author Natali, Lucia
Cossu, Rosa Maria
Barghini, Elena
Giordani, Tommaso
Buti, Matteo
Mascagni, Flavia
Morgante, Michele
Gill, Navdeep
Kane, Nolan C
Rieseberg, Loren
Cavallini, Andrea
author_facet Natali, Lucia
Cossu, Rosa Maria
Barghini, Elena
Giordani, Tommaso
Buti, Matteo
Mascagni, Flavia
Morgante, Michele
Gill, Navdeep
Kane, Nolan C
Rieseberg, Loren
Cavallini, Andrea
author_sort Natali, Lucia
collection PubMed
description BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage Illumina or 454 libraries. RESULTS: By varying sequencing technology (Illumina or 454), coverage (0.55 x-1.25 x), assemblers and assembly procedures, six different genomic databases were produced. The annotation of these databases showed that they were composed of different proportions of repetitive DNA families. The final assembly of the sequences belonging to the six databases produced a whole genome set of 283,800 contigs. The redundancy of each contig was estimated by mapping the whole genome set with a large Illumina read set and measuring the number of matched Illumina reads. The repetitive component amounted to 81% of the sunflower genome, that is composed mainly of numerous families of Gypsy and Copia retrotransposons. Also many families of non autonomous retrotransposons and DNA transposons (especially of the Helitron superfamily) were identified. CONCLUSIONS: The results substantially matched those previously obtained by using a Sanger-sequenced shotgun library and a standard 454 whole-genome-shotgun approach, indicating the reliability of the proposed procedures also for other species. The repetitive sequences were collected to produce a database, SUNREP, that will be useful for the annotation of the sunflower genome sequence and for studying the genome evolution in dicotyledons.
format Online
Article
Text
id pubmed-3852528
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38525282013-12-06 The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads Natali, Lucia Cossu, Rosa Maria Barghini, Elena Giordani, Tommaso Buti, Matteo Mascagni, Flavia Morgante, Michele Gill, Navdeep Kane, Nolan C Rieseberg, Loren Cavallini, Andrea BMC Genomics Research Article BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage Illumina or 454 libraries. RESULTS: By varying sequencing technology (Illumina or 454), coverage (0.55 x-1.25 x), assemblers and assembly procedures, six different genomic databases were produced. The annotation of these databases showed that they were composed of different proportions of repetitive DNA families. The final assembly of the sequences belonging to the six databases produced a whole genome set of 283,800 contigs. The redundancy of each contig was estimated by mapping the whole genome set with a large Illumina read set and measuring the number of matched Illumina reads. The repetitive component amounted to 81% of the sunflower genome, that is composed mainly of numerous families of Gypsy and Copia retrotransposons. Also many families of non autonomous retrotransposons and DNA transposons (especially of the Helitron superfamily) were identified. CONCLUSIONS: The results substantially matched those previously obtained by using a Sanger-sequenced shotgun library and a standard 454 whole-genome-shotgun approach, indicating the reliability of the proposed procedures also for other species. The repetitive sequences were collected to produce a database, SUNREP, that will be useful for the annotation of the sunflower genome sequence and for studying the genome evolution in dicotyledons. BioMed Central 2013-10-06 /pmc/articles/PMC3852528/ /pubmed/24093210 http://dx.doi.org/10.1186/1471-2164-14-686 Text en Copyright © 2013 Natali et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Natali, Lucia
Cossu, Rosa Maria
Barghini, Elena
Giordani, Tommaso
Buti, Matteo
Mascagni, Flavia
Morgante, Michele
Gill, Navdeep
Kane, Nolan C
Rieseberg, Loren
Cavallini, Andrea
The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title_full The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title_fullStr The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title_full_unstemmed The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title_short The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
title_sort repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852528/
https://www.ncbi.nlm.nih.gov/pubmed/24093210
http://dx.doi.org/10.1186/1471-2164-14-686
work_keys_str_mv AT natalilucia therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT cossurosamaria therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT barghinielena therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT giordanitommaso therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT butimatteo therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT mascagniflavia therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT morgantemichele therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT gillnavdeep therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT kanenolanc therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT riesebergloren therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT cavalliniandrea therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT natalilucia repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT cossurosamaria repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT barghinielena repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT giordanitommaso repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT butimatteo repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT mascagniflavia repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT morgantemichele repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT gillnavdeep repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT kanenolanc repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT riesebergloren repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads
AT cavalliniandrea repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads