Cargando…

Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3

BACKGROUND: Standard strategies to identify genomic regions involved in a specific trait variation are often limited by time and resource consuming genotyping methods. Other limiting pre-requisites are the phenotyping of large segregating populations or of diversity panels and the availability and q...

Descripción completa

Detalles Bibliográficos
Autores principales: Prodhomme, Charlotte, Esselink, Danny, Borm, Theo, Visser, Richard G. F., van Eck, Herman J., Vossen, Jack H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540404/
https://www.ncbi.nlm.nih.gov/pubmed/31160919
http://dx.doi.org/10.1186/s13007-019-0445-5
_version_ 1783422608955408384
author Prodhomme, Charlotte
Esselink, Danny
Borm, Theo
Visser, Richard G. F.
van Eck, Herman J.
Vossen, Jack H.
author_facet Prodhomme, Charlotte
Esselink, Danny
Borm, Theo
Visser, Richard G. F.
van Eck, Herman J.
Vossen, Jack H.
author_sort Prodhomme, Charlotte
collection PubMed
description BACKGROUND: Standard strategies to identify genomic regions involved in a specific trait variation are often limited by time and resource consuming genotyping methods. Other limiting pre-requisites are the phenotyping of large segregating populations or of diversity panels and the availability and quality of a closely related reference genome. To overcome these limitations, we designed efficient Comparative Subsequence Sets Analysis (CoSSA) workflows to identify haplotype specific SNPs linked to a trait of interest from Whole Genome Sequencing data. RESULTS: As a model, we used the resistance to Synchytrium endobioticum pathotypes 2, 6 and 18 that co-segregated in a tetraploid full sib population. Genomic DNA from both parents, pedigree genotypes, unrelated potato varieties lacking the wart resistance traits and pools of resistant and susceptible siblings were sequenced. Set algebra and depth filtering of subsequences (k-mers) were used to delete unlinked and common SNPs and to enrich for SNPs from the haplotype(s) harboring the resistance gene(s). Using CoSSA, we identified a major and a minor effect locus. Upon comparison to the reference genome, it was inferred that the major resistance locus, referred to as Sen3, was located on the north arm of chromosome 11 between 1,259,552 and 1,519,485 bp. Furthermore, we could anchor the unanchored superscaffold DMB734 from the potato reference genome to a synthenous interval. CoSSA was also successful in identifying Sen3 in a reference genome independent way thanks to the de novo assembly of paired end reads matching haplotype specific k-mers. The de novo assembly provided more R haplotype specific polymorphisms than the reference genome corresponding region. CoSSA also offers possibilities for pedigree analysis. The origin of Sen3 was traced back until Ora. Finally, the diagnostic power of the haplotype specific markers was shown using a panel of 56 tetraploid varieties. CONCLUSIONS: CoSSA is an efficient, robust and versatile set of workflows for the genetic analysis of a trait of interest using WGS data. Because the WGS data are used without intermediate reads mapping, CoSSA does not require the use of a reference genome. This approach allowed the identification of Sen3 and the design of haplotype specific, diagnostic markers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13007-019-0445-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6540404
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65404042019-06-03 Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3 Prodhomme, Charlotte Esselink, Danny Borm, Theo Visser, Richard G. F. van Eck, Herman J. Vossen, Jack H. Plant Methods Research BACKGROUND: Standard strategies to identify genomic regions involved in a specific trait variation are often limited by time and resource consuming genotyping methods. Other limiting pre-requisites are the phenotyping of large segregating populations or of diversity panels and the availability and quality of a closely related reference genome. To overcome these limitations, we designed efficient Comparative Subsequence Sets Analysis (CoSSA) workflows to identify haplotype specific SNPs linked to a trait of interest from Whole Genome Sequencing data. RESULTS: As a model, we used the resistance to Synchytrium endobioticum pathotypes 2, 6 and 18 that co-segregated in a tetraploid full sib population. Genomic DNA from both parents, pedigree genotypes, unrelated potato varieties lacking the wart resistance traits and pools of resistant and susceptible siblings were sequenced. Set algebra and depth filtering of subsequences (k-mers) were used to delete unlinked and common SNPs and to enrich for SNPs from the haplotype(s) harboring the resistance gene(s). Using CoSSA, we identified a major and a minor effect locus. Upon comparison to the reference genome, it was inferred that the major resistance locus, referred to as Sen3, was located on the north arm of chromosome 11 between 1,259,552 and 1,519,485 bp. Furthermore, we could anchor the unanchored superscaffold DMB734 from the potato reference genome to a synthenous interval. CoSSA was also successful in identifying Sen3 in a reference genome independent way thanks to the de novo assembly of paired end reads matching haplotype specific k-mers. The de novo assembly provided more R haplotype specific polymorphisms than the reference genome corresponding region. CoSSA also offers possibilities for pedigree analysis. The origin of Sen3 was traced back until Ora. Finally, the diagnostic power of the haplotype specific markers was shown using a panel of 56 tetraploid varieties. CONCLUSIONS: CoSSA is an efficient, robust and versatile set of workflows for the genetic analysis of a trait of interest using WGS data. Because the WGS data are used without intermediate reads mapping, CoSSA does not require the use of a reference genome. This approach allowed the identification of Sen3 and the design of haplotype specific, diagnostic markers. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13007-019-0445-5) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-29 /pmc/articles/PMC6540404/ /pubmed/31160919 http://dx.doi.org/10.1186/s13007-019-0445-5 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Prodhomme, Charlotte
Esselink, Danny
Borm, Theo
Visser, Richard G. F.
van Eck, Herman J.
Vossen, Jack H.
Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title_full Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title_fullStr Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title_full_unstemmed Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title_short Comparative Subsequence Sets Analysis (CoSSA) is a robust approach to identify haplotype specific SNPs; mapping and pedigree analysis of a potato wart disease resistance gene Sen3
title_sort comparative subsequence sets analysis (cossa) is a robust approach to identify haplotype specific snps; mapping and pedigree analysis of a potato wart disease resistance gene sen3
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540404/
https://www.ncbi.nlm.nih.gov/pubmed/31160919
http://dx.doi.org/10.1186/s13007-019-0445-5
work_keys_str_mv AT prodhommecharlotte comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3
AT esselinkdanny comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3
AT bormtheo comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3
AT visserrichardgf comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3
AT vaneckhermanj comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3
AT vossenjackh comparativesubsequencesetsanalysiscossaisarobustapproachtoidentifyhaplotypespecificsnpsmappingandpedigreeanalysisofapotatowartdiseaseresistancegenesen3