Cargando…
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wid...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575319/ https://www.ncbi.nlm.nih.gov/pubmed/23324082 http://dx.doi.org/10.1186/1471-2164-14-12 |
_version_ | 1782259700798062592 |
---|---|
author | Ozerov, Mikhail Vasemägi, Anti Wennevik, Vidar Niemelä, Eero Prusov, Sergey Kent, Matthew Vähä, Juha-Pekka |
author_facet | Ozerov, Mikhail Vasemägi, Anti Wennevik, Vidar Niemelä, Eero Prusov, Sergey Kent, Matthew Vähä, Juha-Pekka |
author_sort | Ozerov, Mikhail |
collection | PubMed |
description | BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wide distributed SNPs enabling genome wide association studies (GWAS). However, genotyping significant numbers of individuals with large number of SNPs remains prohibitively expensive for many research groups. A possible solution to this problem is to determine allele frequencies from pooled DNA samples, such ‘allelotyping’ has been presented as a cost-effective alternative to individual genotyping and has become popular in human GWAS. In this article we have tested the effectiveness of DNA pooling to obtain accurate allele frequency estimates for Atlantic salmon (Salmo salar L.) populations using an Illumina SNP-chip. RESULTS: In total, 56 Atlantic salmon DNA pools from 14 populations were analyzed on an Atlantic salmon SNP-chip containing probes for 5568 SNP markers, 3928 of which were bi-allelic. We developed an efficient quality control filter which enables exclusion of loci showing high error rate and minor allele frequency (MAF) close to zero. After applying multiple quality control filters we obtained allele frequency estimates for 3631 bi-allelic loci. We observed high concordance (r > 0.99) between allele frequency estimates derived from individual genotyping and DNA pools. Our results also indicate that even relatively small DNA pools (35 individuals) can provide accurate allele frequency estimates for a given sample. CONCLUSIONS: Despite of higher level of variation associated with array replicates compared to pool construction, we suggest that both sources of variation should be taken into account. This study demonstrates that DNA pooling allows fast and high-throughput determination of allele frequencies in Atlantic salmon enabling cost-efficient identification of informative markers for discrimination of populations at various geographical scales, as well as identification of loci controlling ecologically and economically important traits. |
format | Online Article Text |
id | pubmed-3575319 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35753192013-02-22 Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) Ozerov, Mikhail Vasemägi, Anti Wennevik, Vidar Niemelä, Eero Prusov, Sergey Kent, Matthew Vähä, Juha-Pekka BMC Genomics Methodology Article BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wide distributed SNPs enabling genome wide association studies (GWAS). However, genotyping significant numbers of individuals with large number of SNPs remains prohibitively expensive for many research groups. A possible solution to this problem is to determine allele frequencies from pooled DNA samples, such ‘allelotyping’ has been presented as a cost-effective alternative to individual genotyping and has become popular in human GWAS. In this article we have tested the effectiveness of DNA pooling to obtain accurate allele frequency estimates for Atlantic salmon (Salmo salar L.) populations using an Illumina SNP-chip. RESULTS: In total, 56 Atlantic salmon DNA pools from 14 populations were analyzed on an Atlantic salmon SNP-chip containing probes for 5568 SNP markers, 3928 of which were bi-allelic. We developed an efficient quality control filter which enables exclusion of loci showing high error rate and minor allele frequency (MAF) close to zero. After applying multiple quality control filters we obtained allele frequency estimates for 3631 bi-allelic loci. We observed high concordance (r > 0.99) between allele frequency estimates derived from individual genotyping and DNA pools. Our results also indicate that even relatively small DNA pools (35 individuals) can provide accurate allele frequency estimates for a given sample. CONCLUSIONS: Despite of higher level of variation associated with array replicates compared to pool construction, we suggest that both sources of variation should be taken into account. This study demonstrates that DNA pooling allows fast and high-throughput determination of allele frequencies in Atlantic salmon enabling cost-efficient identification of informative markers for discrimination of populations at various geographical scales, as well as identification of loci controlling ecologically and economically important traits. BioMed Central 2013-01-16 /pmc/articles/PMC3575319/ /pubmed/23324082 http://dx.doi.org/10.1186/1471-2164-14-12 Text en Copyright ©2013 Ozerov et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Ozerov, Mikhail Vasemägi, Anti Wennevik, Vidar Niemelä, Eero Prusov, Sergey Kent, Matthew Vähä, Juha-Pekka Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title | Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title_full | Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title_fullStr | Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title_full_unstemmed | Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title_short | Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) |
title_sort | cost-effective genome-wide estimation of allele frequencies from pooled dna in atlantic salmon (salmo salar l.) |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575319/ https://www.ncbi.nlm.nih.gov/pubmed/23324082 http://dx.doi.org/10.1186/1471-2164-14-12 |
work_keys_str_mv | AT ozerovmikhail costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT vasemagianti costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT wennevikvidar costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT niemelaeero costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT prusovsergey costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT kentmatthew costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl AT vahajuhapekka costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl |