Cargando…
A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover
BACKGROUND: White clover (Trifolium repens L.) is an allotetraploid species possessing two highly collinear ancestral sub-genomes. The apparent existence of highly similar homeolog copies for the majority of genes in white clover is problematic for the development of genome-based resources in the sp...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3727989/ https://www.ncbi.nlm.nih.gov/pubmed/23402685 http://dx.doi.org/10.1186/1471-2164-14-100 |
_version_ | 1782278787663134720 |
---|---|
author | Nagy, Istvan Barth, Susanne Mehenni-Ciz, Jeanne Abberton, Michael T Milbourne, Dan |
author_facet | Nagy, Istvan Barth, Susanne Mehenni-Ciz, Jeanne Abberton, Michael T Milbourne, Dan |
author_sort | Nagy, Istvan |
collection | PubMed |
description | BACKGROUND: White clover (Trifolium repens L.) is an allotetraploid species possessing two highly collinear ancestral sub-genomes. The apparent existence of highly similar homeolog copies for the majority of genes in white clover is problematic for the development of genome-based resources in the species. This is especially true for the development of genetic markers based on single nucleotide polymorphisms (SNPs), since it is difficult to distinguish between homeolog-specific and allelic variants. Robust methods for categorising single nucleotide variants as allelic or homeolog-specific in large transcript datasets are required. We illustrate one potential approach in this study. RESULTS: We used 454-pyrosequencing sequencing to generate ~760,000 transcript sequences from an 8th generation white clover inbred line. These were assembled and partially annotated to yield a reference transcript set comprising 71,545 sequences. We subsequently performed Illumina sequencing on three further white clover samples, generating 14 million transcript reads from a mixed sample comprising 24 divergent white clover genotypes, and 50 million reads on two further eighth generation white clover inbred lines. Mapping these reads to the reference transcript set allowed us to develop a significant SNP resource for white clover, and to partition the SNPs from the inbred lines into categories reflecting allelic or homeolog-specific variation. The potential for using haplotype reconstruction and progenitor genome comparison to assign haplotypes to specific ancestral sub-genomes of white clover is demonstrated for sequences corresponding to genes encoding dehydration responsive element binding protein and acyl-coA oxidase. CONCLUSIONS: In total, 208,854 independent SNPs in 31,715 reference sequences were discovered, approximately three quarters of which were categorised as representing allelic or homeolog-specific variation using two inbred lines. This represents a significant resource for white clover genomics and genetics studies. We discuss the potential to extend the analysis to identify a “core set” of ancestrally derived homeolog specific variants in white clover. |
format | Online Article Text |
id | pubmed-3727989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37279892013-07-31 A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover Nagy, Istvan Barth, Susanne Mehenni-Ciz, Jeanne Abberton, Michael T Milbourne, Dan BMC Genomics Research Article BACKGROUND: White clover (Trifolium repens L.) is an allotetraploid species possessing two highly collinear ancestral sub-genomes. The apparent existence of highly similar homeolog copies for the majority of genes in white clover is problematic for the development of genome-based resources in the species. This is especially true for the development of genetic markers based on single nucleotide polymorphisms (SNPs), since it is difficult to distinguish between homeolog-specific and allelic variants. Robust methods for categorising single nucleotide variants as allelic or homeolog-specific in large transcript datasets are required. We illustrate one potential approach in this study. RESULTS: We used 454-pyrosequencing sequencing to generate ~760,000 transcript sequences from an 8th generation white clover inbred line. These were assembled and partially annotated to yield a reference transcript set comprising 71,545 sequences. We subsequently performed Illumina sequencing on three further white clover samples, generating 14 million transcript reads from a mixed sample comprising 24 divergent white clover genotypes, and 50 million reads on two further eighth generation white clover inbred lines. Mapping these reads to the reference transcript set allowed us to develop a significant SNP resource for white clover, and to partition the SNPs from the inbred lines into categories reflecting allelic or homeolog-specific variation. The potential for using haplotype reconstruction and progenitor genome comparison to assign haplotypes to specific ancestral sub-genomes of white clover is demonstrated for sequences corresponding to genes encoding dehydration responsive element binding protein and acyl-coA oxidase. CONCLUSIONS: In total, 208,854 independent SNPs in 31,715 reference sequences were discovered, approximately three quarters of which were categorised as representing allelic or homeolog-specific variation using two inbred lines. This represents a significant resource for white clover genomics and genetics studies. We discuss the potential to extend the analysis to identify a “core set” of ancestrally derived homeolog specific variants in white clover. BioMed Central 2013-02-13 /pmc/articles/PMC3727989/ /pubmed/23402685 http://dx.doi.org/10.1186/1471-2164-14-100 Text en Copyright © 2013 Nagy et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Nagy, Istvan Barth, Susanne Mehenni-Ciz, Jeanne Abberton, Michael T Milbourne, Dan A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title | A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title_full | A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title_fullStr | A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title_full_unstemmed | A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title_short | A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
title_sort | hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3727989/ https://www.ncbi.nlm.nih.gov/pubmed/23402685 http://dx.doi.org/10.1186/1471-2164-14-100 |
work_keys_str_mv | AT nagyistvan ahybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT barthsusanne ahybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT mehennicizjeanne ahybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT abbertonmichaelt ahybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT milbournedan ahybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT nagyistvan hybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT barthsusanne hybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT mehennicizjeanne hybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT abbertonmichaelt hybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover AT milbournedan hybridnextgenerationtranscriptsequencingbasedapproachtoidentifyallelicandhomeologspecificsinglenucleotidepolymorphismsinallotetraploidwhiteclover |