Cargando…
Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517385/ https://www.ncbi.nlm.nih.gov/pubmed/26215102 http://dx.doi.org/10.1186/s12864-015-1761-4 |
_version_ | 1782383180105383936 |
---|---|
author | Cokus, Shawn J. Gugger, Paul F. Sork, Victoria L. |
author_facet | Cokus, Shawn J. Gugger, Paul F. Sork, Victoria L. |
author_sort | Cokus, Shawn J. |
collection | PubMed |
description | BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest genomicists studying this ecologically and economically important genus. We further performed preliminary analyses of genes important in interspecific divergent (positive) selection that might explain ecological differences among species, estimating rates of nonsynonymous to synonymous substitutions (d(N)/d(S)) and Fay and Wu’s H. Functional classes of genes were tested for unusually high d(N)/d(S) or low H consistent with divergent positive selection. RESULTS: Our draft transcriptome is among the most complete for oaks, including 83,644 contigs (23,329 ≥ 1 kbp), 14,898 complete and 13,778 partial gene models, and functional annotations for 9,431 Arabidopsis orthologs and 19,365 contigs with Pfam hits. We identified 1.7 million possible sequence variants including 1.1 million high-quality diallelic SNPs — among the largest sets identified in any tree. 11 of 18 functional categories with significantly elevated d(N)/d(S) are involved in disease response, including 50+ genes with d(N)/d(S) > 1. Other high-d(N)/d(S) genes are involved in biotic response, flowering and growth, or regulatory processes. In contrast, median d(N)/d(S) was low (0.22), suggesting that purifying selection influences most genes. No functional categories have unusually low H. CONCLUSIONS: These results offer preliminary support for the hypothesis that divergent selection at pathogen resistance are important factors in species divergence in these hybridizing California oaks. Our transcriptome provides a solid foundation for future studies of gene expression, natural selection, and speciation in Quercus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1761-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4517385 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45173852015-07-29 Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks Cokus, Shawn J. Gugger, Paul F. Sork, Victoria L. BMC Genomics Research Article BACKGROUND: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest genomicists studying this ecologically and economically important genus. We further performed preliminary analyses of genes important in interspecific divergent (positive) selection that might explain ecological differences among species, estimating rates of nonsynonymous to synonymous substitutions (d(N)/d(S)) and Fay and Wu’s H. Functional classes of genes were tested for unusually high d(N)/d(S) or low H consistent with divergent positive selection. RESULTS: Our draft transcriptome is among the most complete for oaks, including 83,644 contigs (23,329 ≥ 1 kbp), 14,898 complete and 13,778 partial gene models, and functional annotations for 9,431 Arabidopsis orthologs and 19,365 contigs with Pfam hits. We identified 1.7 million possible sequence variants including 1.1 million high-quality diallelic SNPs — among the largest sets identified in any tree. 11 of 18 functional categories with significantly elevated d(N)/d(S) are involved in disease response, including 50+ genes with d(N)/d(S) > 1. Other high-d(N)/d(S) genes are involved in biotic response, flowering and growth, or regulatory processes. In contrast, median d(N)/d(S) was low (0.22), suggesting that purifying selection influences most genes. No functional categories have unusually low H. CONCLUSIONS: These results offer preliminary support for the hypothesis that divergent selection at pathogen resistance are important factors in species divergence in these hybridizing California oaks. Our transcriptome provides a solid foundation for future studies of gene expression, natural selection, and speciation in Quercus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1761-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-28 /pmc/articles/PMC4517385/ /pubmed/26215102 http://dx.doi.org/10.1186/s12864-015-1761-4 Text en © Cokus et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Cokus, Shawn J. Gugger, Paul F. Sork, Victoria L. Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title | Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title_full | Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title_fullStr | Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title_full_unstemmed | Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title_short | Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks |
title_sort | evolutionary insights from de novo transcriptome assembly and snp discovery in california white oaks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517385/ https://www.ncbi.nlm.nih.gov/pubmed/26215102 http://dx.doi.org/10.1186/s12864-015-1761-4 |
work_keys_str_mv | AT cokusshawnj evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks AT guggerpaulf evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks AT sorkvictorial evolutionaryinsightsfromdenovotranscriptomeassemblyandsnpdiscoveryincaliforniawhiteoaks |