Cargando…

Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples

BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the maj...

Descripción completa

Detalles Bibliográficos
Autores principales: Henshall, John M, Dierens, Leanne, Sellars, Melony J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4244062/
https://www.ncbi.nlm.nih.gov/pubmed/25183297
http://dx.doi.org/10.1186/s12711-014-0051-y
_version_ 1782346179474882560
author Henshall, John M
Dierens, Leanne
Sellars, Melony J
author_facet Henshall, John M
Dierens, Leanne
Sellars, Melony J
author_sort Henshall, John M
collection PubMed
description BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0051-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4244062
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42440622014-11-28 Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples Henshall, John M Dierens, Leanne Sellars, Melony J Genet Sel Evol Research BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0051-y) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-02 /pmc/articles/PMC4244062/ /pubmed/25183297 http://dx.doi.org/10.1186/s12711-014-0051-y Text en © Henshall et al.; licensee BioMed Central 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research
Henshall, John M
Dierens, Leanne
Sellars, Melony J
Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title_full Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title_fullStr Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title_full_unstemmed Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title_short Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
title_sort quantitative analysis of low-density snp data for parentage assignment and estimation of family contributions to pooled samples
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4244062/
https://www.ncbi.nlm.nih.gov/pubmed/25183297
http://dx.doi.org/10.1186/s12711-014-0051-y
work_keys_str_mv AT henshalljohnm quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples
AT dierensleanne quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples
AT sellarsmelonyj quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples