Cargando…

A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples

Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is as...

Descripción completa

Detalles Bibliográficos
Autores principales: Yahav, Tal, Privman, Eyal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6482151/
https://www.ncbi.nlm.nih.gov/pubmed/31019201
http://dx.doi.org/10.1038/s41598-019-42795-6
_version_ 1783413832559886336
author Yahav, Tal
Privman, Eyal
author_facet Yahav, Tal
Privman, Eyal
author_sort Yahav, Tal
collection PubMed
description Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is assumed advantageous for genome assembly. For the purpose of gene annotation, transcriptome sequencing is usually conducted using RNA from a pool of individuals. We conducted a comparative analysis of genome and transcriptome assembly and annotation methods, using genetic sources of different ploidy: (1) DNA from a haploid male or a diploid female (2) RNA from the same haploid male or a pool of individuals. We predicted that the use of a haploid male as opposed to a diploid female will simplify the genome assembly and gene annotation thanks to the lack of heterozygosity. Using DNA and RNA from the same haploid individual is expected to provide better confidence in transcript-to-genome alignment, and improve the annotation of gene structure in terms of the exon/intron boundaries. The haploid genome assemblies proved to be more contiguous, with both contig and scaffold N50 size at least threefold greater than their diploid counterparts. Completeness evaluation showed mixed results. The SOAPdenovo2 diploid assembly was missing more genes than the haploid assembly. The SPAdes diploid assembly had more complete genes, but a higher level of duplicates, and a greatly overestimated genome size. When aligning the two transcriptomes against the male genome, the male transcriptome gave 2–3% more complete transcripts than the pool transcriptome for genes with comparable expression levels in both transcriptomes. However, this advantage disappears in the final results of the gene annotation pipeline that incorporates evidence from homologous proteins. The RNA pool is still required to obtain the full transcriptome with genes that are expressed in other life stages and castes. In conclusion, the use of a haploid source material for a de novo genome project provides a substantial advantage to the quality of the genome draft and the use of RNA from the same haploid individual for transcriptome to genome alignment provides a minor advantage for genes that are expressed in the adult male.
format Online
Article
Text
id pubmed-6482151
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-64821512019-05-03 A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples Yahav, Tal Privman, Eyal Sci Rep Article Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is assumed advantageous for genome assembly. For the purpose of gene annotation, transcriptome sequencing is usually conducted using RNA from a pool of individuals. We conducted a comparative analysis of genome and transcriptome assembly and annotation methods, using genetic sources of different ploidy: (1) DNA from a haploid male or a diploid female (2) RNA from the same haploid male or a pool of individuals. We predicted that the use of a haploid male as opposed to a diploid female will simplify the genome assembly and gene annotation thanks to the lack of heterozygosity. Using DNA and RNA from the same haploid individual is expected to provide better confidence in transcript-to-genome alignment, and improve the annotation of gene structure in terms of the exon/intron boundaries. The haploid genome assemblies proved to be more contiguous, with both contig and scaffold N50 size at least threefold greater than their diploid counterparts. Completeness evaluation showed mixed results. The SOAPdenovo2 diploid assembly was missing more genes than the haploid assembly. The SPAdes diploid assembly had more complete genes, but a higher level of duplicates, and a greatly overestimated genome size. When aligning the two transcriptomes against the male genome, the male transcriptome gave 2–3% more complete transcripts than the pool transcriptome for genes with comparable expression levels in both transcriptomes. However, this advantage disappears in the final results of the gene annotation pipeline that incorporates evidence from homologous proteins. The RNA pool is still required to obtain the full transcriptome with genes that are expressed in other life stages and castes. In conclusion, the use of a haploid source material for a de novo genome project provides a substantial advantage to the quality of the genome draft and the use of RNA from the same haploid individual for transcriptome to genome alignment provides a minor advantage for genes that are expressed in the adult male. Nature Publishing Group UK 2019-04-24 /pmc/articles/PMC6482151/ /pubmed/31019201 http://dx.doi.org/10.1038/s41598-019-42795-6 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Yahav, Tal
Privman, Eyal
A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title_full A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title_fullStr A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title_full_unstemmed A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title_short A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
title_sort comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6482151/
https://www.ncbi.nlm.nih.gov/pubmed/31019201
http://dx.doi.org/10.1038/s41598-019-42795-6
work_keys_str_mv AT yahavtal acomparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples
AT privmaneyal acomparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples
AT yahavtal comparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples
AT privmaneyal comparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples