Cargando…
A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples
Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is as...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6482151/ https://www.ncbi.nlm.nih.gov/pubmed/31019201 http://dx.doi.org/10.1038/s41598-019-42795-6 |
_version_ | 1783413832559886336 |
---|---|
author | Yahav, Tal Privman, Eyal |
author_facet | Yahav, Tal Privman, Eyal |
author_sort | Yahav, Tal |
collection | PubMed |
description | Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is assumed advantageous for genome assembly. For the purpose of gene annotation, transcriptome sequencing is usually conducted using RNA from a pool of individuals. We conducted a comparative analysis of genome and transcriptome assembly and annotation methods, using genetic sources of different ploidy: (1) DNA from a haploid male or a diploid female (2) RNA from the same haploid male or a pool of individuals. We predicted that the use of a haploid male as opposed to a diploid female will simplify the genome assembly and gene annotation thanks to the lack of heterozygosity. Using DNA and RNA from the same haploid individual is expected to provide better confidence in transcript-to-genome alignment, and improve the annotation of gene structure in terms of the exon/intron boundaries. The haploid genome assemblies proved to be more contiguous, with both contig and scaffold N50 size at least threefold greater than their diploid counterparts. Completeness evaluation showed mixed results. The SOAPdenovo2 diploid assembly was missing more genes than the haploid assembly. The SPAdes diploid assembly had more complete genes, but a higher level of duplicates, and a greatly overestimated genome size. When aligning the two transcriptomes against the male genome, the male transcriptome gave 2–3% more complete transcripts than the pool transcriptome for genes with comparable expression levels in both transcriptomes. However, this advantage disappears in the final results of the gene annotation pipeline that incorporates evidence from homologous proteins. The RNA pool is still required to obtain the full transcriptome with genes that are expressed in other life stages and castes. In conclusion, the use of a haploid source material for a de novo genome project provides a substantial advantage to the quality of the genome draft and the use of RNA from the same haploid individual for transcriptome to genome alignment provides a minor advantage for genes that are expressed in the adult male. |
format | Online Article Text |
id | pubmed-6482151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-64821512019-05-03 A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples Yahav, Tal Privman, Eyal Sci Rep Article Diverse invertebrate taxa including all 200,000 species of Hymenoptera (ants, bees, wasps and sawflies) have a haplodiploid sex determination system, where females are diploid and males are haploid. Thus, hymenopteran genome projects can make use of DNA from a single haploid male sample, which is assumed advantageous for genome assembly. For the purpose of gene annotation, transcriptome sequencing is usually conducted using RNA from a pool of individuals. We conducted a comparative analysis of genome and transcriptome assembly and annotation methods, using genetic sources of different ploidy: (1) DNA from a haploid male or a diploid female (2) RNA from the same haploid male or a pool of individuals. We predicted that the use of a haploid male as opposed to a diploid female will simplify the genome assembly and gene annotation thanks to the lack of heterozygosity. Using DNA and RNA from the same haploid individual is expected to provide better confidence in transcript-to-genome alignment, and improve the annotation of gene structure in terms of the exon/intron boundaries. The haploid genome assemblies proved to be more contiguous, with both contig and scaffold N50 size at least threefold greater than their diploid counterparts. Completeness evaluation showed mixed results. The SOAPdenovo2 diploid assembly was missing more genes than the haploid assembly. The SPAdes diploid assembly had more complete genes, but a higher level of duplicates, and a greatly overestimated genome size. When aligning the two transcriptomes against the male genome, the male transcriptome gave 2–3% more complete transcripts than the pool transcriptome for genes with comparable expression levels in both transcriptomes. However, this advantage disappears in the final results of the gene annotation pipeline that incorporates evidence from homologous proteins. The RNA pool is still required to obtain the full transcriptome with genes that are expressed in other life stages and castes. In conclusion, the use of a haploid source material for a de novo genome project provides a substantial advantage to the quality of the genome draft and the use of RNA from the same haploid individual for transcriptome to genome alignment provides a minor advantage for genes that are expressed in the adult male. Nature Publishing Group UK 2019-04-24 /pmc/articles/PMC6482151/ /pubmed/31019201 http://dx.doi.org/10.1038/s41598-019-42795-6 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Yahav, Tal Privman, Eyal A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title | A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title_full | A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title_fullStr | A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title_full_unstemmed | A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title_short | A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
title_sort | comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6482151/ https://www.ncbi.nlm.nih.gov/pubmed/31019201 http://dx.doi.org/10.1038/s41598-019-42795-6 |
work_keys_str_mv | AT yahavtal acomparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples AT privmaneyal acomparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples AT yahavtal comparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples AT privmaneyal comparativeanalysisofmethodsfordenovoassemblyofhymenopterangenomesusingeitherhaploidordiploidsamples |