Cargando…
The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads
BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by usin...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852528/ https://www.ncbi.nlm.nih.gov/pubmed/24093210 http://dx.doi.org/10.1186/1471-2164-14-686 |
_version_ | 1782478680041193472 |
---|---|
author | Natali, Lucia Cossu, Rosa Maria Barghini, Elena Giordani, Tommaso Buti, Matteo Mascagni, Flavia Morgante, Michele Gill, Navdeep Kane, Nolan C Rieseberg, Loren Cavallini, Andrea |
author_facet | Natali, Lucia Cossu, Rosa Maria Barghini, Elena Giordani, Tommaso Buti, Matteo Mascagni, Flavia Morgante, Michele Gill, Navdeep Kane, Nolan C Rieseberg, Loren Cavallini, Andrea |
author_sort | Natali, Lucia |
collection | PubMed |
description | BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage Illumina or 454 libraries. RESULTS: By varying sequencing technology (Illumina or 454), coverage (0.55 x-1.25 x), assemblers and assembly procedures, six different genomic databases were produced. The annotation of these databases showed that they were composed of different proportions of repetitive DNA families. The final assembly of the sequences belonging to the six databases produced a whole genome set of 283,800 contigs. The redundancy of each contig was estimated by mapping the whole genome set with a large Illumina read set and measuring the number of matched Illumina reads. The repetitive component amounted to 81% of the sunflower genome, that is composed mainly of numerous families of Gypsy and Copia retrotransposons. Also many families of non autonomous retrotransposons and DNA transposons (especially of the Helitron superfamily) were identified. CONCLUSIONS: The results substantially matched those previously obtained by using a Sanger-sequenced shotgun library and a standard 454 whole-genome-shotgun approach, indicating the reliability of the proposed procedures also for other species. The repetitive sequences were collected to produce a database, SUNREP, that will be useful for the annotation of the sunflower genome sequence and for studying the genome evolution in dicotyledons. |
format | Online Article Text |
id | pubmed-3852528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38525282013-12-06 The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads Natali, Lucia Cossu, Rosa Maria Barghini, Elena Giordani, Tommaso Buti, Matteo Mascagni, Flavia Morgante, Michele Gill, Navdeep Kane, Nolan C Rieseberg, Loren Cavallini, Andrea BMC Genomics Research Article BACKGROUND: Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage Illumina or 454 libraries. RESULTS: By varying sequencing technology (Illumina or 454), coverage (0.55 x-1.25 x), assemblers and assembly procedures, six different genomic databases were produced. The annotation of these databases showed that they were composed of different proportions of repetitive DNA families. The final assembly of the sequences belonging to the six databases produced a whole genome set of 283,800 contigs. The redundancy of each contig was estimated by mapping the whole genome set with a large Illumina read set and measuring the number of matched Illumina reads. The repetitive component amounted to 81% of the sunflower genome, that is composed mainly of numerous families of Gypsy and Copia retrotransposons. Also many families of non autonomous retrotransposons and DNA transposons (especially of the Helitron superfamily) were identified. CONCLUSIONS: The results substantially matched those previously obtained by using a Sanger-sequenced shotgun library and a standard 454 whole-genome-shotgun approach, indicating the reliability of the proposed procedures also for other species. The repetitive sequences were collected to produce a database, SUNREP, that will be useful for the annotation of the sunflower genome sequence and for studying the genome evolution in dicotyledons. BioMed Central 2013-10-06 /pmc/articles/PMC3852528/ /pubmed/24093210 http://dx.doi.org/10.1186/1471-2164-14-686 Text en Copyright © 2013 Natali et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Natali, Lucia Cossu, Rosa Maria Barghini, Elena Giordani, Tommaso Buti, Matteo Mascagni, Flavia Morgante, Michele Gill, Navdeep Kane, Nolan C Rieseberg, Loren Cavallini, Andrea The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title | The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title_full | The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title_fullStr | The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title_full_unstemmed | The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title_short | The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
title_sort | repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852528/ https://www.ncbi.nlm.nih.gov/pubmed/24093210 http://dx.doi.org/10.1186/1471-2164-14-686 |
work_keys_str_mv | AT natalilucia therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT cossurosamaria therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT barghinielena therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT giordanitommaso therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT butimatteo therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT mascagniflavia therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT morgantemichele therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT gillnavdeep therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT kanenolanc therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT riesebergloren therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT cavalliniandrea therepetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT natalilucia repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT cossurosamaria repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT barghinielena repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT giordanitommaso repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT butimatteo repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT mascagniflavia repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT morgantemichele repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT gillnavdeep repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT kanenolanc repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT riesebergloren repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads AT cavalliniandrea repetitivecomponentofthesunflowergenomeasshownbydifferentproceduresforassemblingnextgenerationsequencingreads |