Cargando…
Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples
BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the maj...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4244062/ https://www.ncbi.nlm.nih.gov/pubmed/25183297 http://dx.doi.org/10.1186/s12711-014-0051-y |
_version_ | 1782346179474882560 |
---|---|
author | Henshall, John M Dierens, Leanne Sellars, Melony J |
author_facet | Henshall, John M Dierens, Leanne Sellars, Melony J |
author_sort | Henshall, John M |
collection | PubMed |
description | BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0051-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4244062 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42440622014-11-28 Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples Henshall, John M Dierens, Leanne Sellars, Melony J Genet Sel Evol Research BACKGROUND: While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. RESULTS: For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. CONCLUSIONS: Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0051-y) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-02 /pmc/articles/PMC4244062/ /pubmed/25183297 http://dx.doi.org/10.1186/s12711-014-0051-y Text en © Henshall et al.; licensee BioMed Central 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Henshall, John M Dierens, Leanne Sellars, Melony J Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title | Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title_full | Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title_fullStr | Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title_full_unstemmed | Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title_short | Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples |
title_sort | quantitative analysis of low-density snp data for parentage assignment and estimation of family contributions to pooled samples |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4244062/ https://www.ncbi.nlm.nih.gov/pubmed/25183297 http://dx.doi.org/10.1186/s12711-014-0051-y |
work_keys_str_mv | AT henshalljohnm quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples AT dierensleanne quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples AT sellarsmelonyj quantitativeanalysisoflowdensitysnpdataforparentageassignmentandestimationoffamilycontributionstopooledsamples |