Cargando…

Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks

BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest...

Descripción completa

Detalles Bibliográficos
Autores principales: Cokus, Shawn J., Gugger, Paul F., Sork, Victoria L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517385/
https://www.ncbi.nlm.nih.gov/pubmed/26215102
http://dx.doi.org/10.1186/s12864-015-1761-4
_version_ 1782383180105383936
author Cokus, Shawn J.
Gugger, Paul F.
Sork, Victoria L.
author_facet Cokus, Shawn J.
Gugger, Paul F.
Sork, Victoria L.
author_sort Cokus, Shawn J.
collection PubMed
description BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest genomicists studying this ecologically and economically important genus. We further performed preliminary analyses of genes important in interspecific divergent (positive) selection that might explain ecological differences among species, estimating rates of nonsynonymous to synonymous substitutions (d(N)/d(S)) and Fay and Wu’s H. Functional classes of genes were tested for unusually high d(N)/d(S) or low H consistent with divergent positive selection. RESULTS: Our draft transcriptome is among the most complete for oaks, including 83,644 contigs (23,329 ≥ 1 kbp), 14,898 complete and 13,778 partial gene models, and functional annotations for 9,431 Arabidopsis orthologs and 19,365 contigs with Pfam hits. We identified 1.7 million possible sequence variants including 1.1 million high-quality diallelic SNPs — among the largest sets identified in any tree. 11 of 18 functional categories with significantly elevated d(N)/d(S) are involved in disease response, including 50+ genes with d(N)/d(S) > 1. Other high-d(N)/d(S) genes are involved in biotic response, flowering and growth, or regulatory processes. In contrast, median d(N)/d(S) was low (0.22), suggesting that purifying selection influences most genes. No functional categories have unusually low H. CONCLUSIONS: These results offer preliminary support for the hypothesis that divergent selection at pathogen resistance are important factors in species divergence in these hybridizing California oaks. Our transcriptome provides a solid foundation for future studies of gene expression, natural selection, and speciation in Quercus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1761-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4517385
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45173852015-07-29 Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks Cokus, Shawn J. Gugger, Paul F. Sork, Victoria L. BMC Genomics Research Article BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest genomicists studying this ecologically and economically important genus. We further performed preliminary analyses of genes important in interspecific divergent (positive) selection that might explain ecological differences among species, estimating rates of nonsynonymous to synonymous substitutions (d(N)/d(S)) and Fay and Wu’s H. Functional classes of genes were tested for unusually high d(N)/d(S) or low H consistent with divergent positive selection. RESULTS: Our draft transcriptome is among the most complete for oaks, including 83,644 contigs (23,329 ≥ 1 kbp), 14,898 complete and 13,778 partial gene models, and functional annotations for 9,431 Arabidopsis orthologs and 19,365 contigs with Pfam hits. We identified 1.7 million possible sequence variants including 1.1 million high-quality diallelic SNPs — among the largest sets identified in any tree. 11 of 18 functional categories with significantly elevated d(N)/d(S) are involved in disease response, including 50+ genes with d(N)/d(S) > 1. Other high-d(N)/d(S) genes are involved in biotic response, flowering and growth, or regulatory processes. In contrast, median d(N)/d(S) was low (0.22), suggesting that purifying selection influences most genes. No functional categories have unusually low H. CONCLUSIONS: These results offer preliminary support for the hypothesis that divergent selection at pathogen resistance are important factors in species divergence in these hybridizing California oaks. Our transcriptome provides a solid foundation for future studies of gene expression, natural selection, and speciation in Quercus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1761-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-28 /pmc/articles/PMC4517385/ /pubmed/26215102 http://dx.doi.org/10.1186/s12864-015-1761-4 Text en © Cokus et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cokus, Shawn J.
Gugger, Paul F.
Sork, Victoria L.
Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title_full Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title_fullStr Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title_full_unstemmed Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title_short Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
title_sort evolutionary insights from de novo transcriptome assembly and snp discovery in california white oaks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517385/
https://www.ncbi.nlm.nih.gov/pubmed/26215102
http://dx.doi.org/10.1186/s12864-015-1761-4
work_keys_str_mv AT cokusshawnj evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks
AT guggerpaulf evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks
AT sorkvictorial evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks