Cargando…

Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)

BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wid...

Descripción completa

Detalles Bibliográficos
Autores principales: Ozerov, Mikhail, Vasemägi, Anti, Wennevik, Vidar, Niemelä, Eero, Prusov, Sergey, Kent, Matthew, Vähä, Juha-Pekka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575319/
https://www.ncbi.nlm.nih.gov/pubmed/23324082
http://dx.doi.org/10.1186/1471-2164-14-12
_version_ 1782259700798062592
author Ozerov, Mikhail
Vasemägi, Anti
Wennevik, Vidar
Niemelä, Eero
Prusov, Sergey
Kent, Matthew
Vähä, Juha-Pekka
author_facet Ozerov, Mikhail
Vasemägi, Anti
Wennevik, Vidar
Niemelä, Eero
Prusov, Sergey
Kent, Matthew
Vähä, Juha-Pekka
author_sort Ozerov, Mikhail
collection PubMed
description BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wide distributed SNPs enabling genome wide association studies (GWAS). However, genotyping significant numbers of individuals with large number of SNPs remains prohibitively expensive for many research groups. A possible solution to this problem is to determine allele frequencies from pooled DNA samples, such ‘allelotyping’ has been presented as a cost-effective alternative to individual genotyping and has become popular in human GWAS. In this article we have tested the effectiveness of DNA pooling to obtain accurate allele frequency estimates for Atlantic salmon (Salmo salar L.) populations using an Illumina SNP-chip. RESULTS: In total, 56 Atlantic salmon DNA pools from 14 populations were analyzed on an Atlantic salmon SNP-chip containing probes for 5568 SNP markers, 3928 of which were bi-allelic. We developed an efficient quality control filter which enables exclusion of loci showing high error rate and minor allele frequency (MAF) close to zero. After applying multiple quality control filters we obtained allele frequency estimates for 3631 bi-allelic loci. We observed high concordance (r > 0.99) between allele frequency estimates derived from individual genotyping and DNA pools. Our results also indicate that even relatively small DNA pools (35 individuals) can provide accurate allele frequency estimates for a given sample. CONCLUSIONS: Despite of higher level of variation associated with array replicates compared to pool construction, we suggest that both sources of variation should be taken into account. This study demonstrates that DNA pooling allows fast and high-throughput determination of allele frequencies in Atlantic salmon enabling cost-efficient identification of informative markers for discrimination of populations at various geographical scales, as well as identification of loci controlling ecologically and economically important traits.
format Online
Article
Text
id pubmed-3575319
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35753192013-02-22 Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.) Ozerov, Mikhail Vasemägi, Anti Wennevik, Vidar Niemelä, Eero Prusov, Sergey Kent, Matthew Vähä, Juha-Pekka BMC Genomics Methodology Article BACKGROUND: New sequencing technologies have tremendously increased the number of known molecular markers (single nucleotide polymorphisms; SNPs) in a variety of species. Concurrently, improvements to genotyping technology have now made it possible to efficiently genotype large numbers of genome-wide distributed SNPs enabling genome wide association studies (GWAS). However, genotyping significant numbers of individuals with large number of SNPs remains prohibitively expensive for many research groups. A possible solution to this problem is to determine allele frequencies from pooled DNA samples, such ‘allelotyping’ has been presented as a cost-effective alternative to individual genotyping and has become popular in human GWAS. In this article we have tested the effectiveness of DNA pooling to obtain accurate allele frequency estimates for Atlantic salmon (Salmo salar L.) populations using an Illumina SNP-chip. RESULTS: In total, 56 Atlantic salmon DNA pools from 14 populations were analyzed on an Atlantic salmon SNP-chip containing probes for 5568 SNP markers, 3928 of which were bi-allelic. We developed an efficient quality control filter which enables exclusion of loci showing high error rate and minor allele frequency (MAF) close to zero. After applying multiple quality control filters we obtained allele frequency estimates for 3631 bi-allelic loci. We observed high concordance (r > 0.99) between allele frequency estimates derived from individual genotyping and DNA pools. Our results also indicate that even relatively small DNA pools (35 individuals) can provide accurate allele frequency estimates for a given sample. CONCLUSIONS: Despite of higher level of variation associated with array replicates compared to pool construction, we suggest that both sources of variation should be taken into account. This study demonstrates that DNA pooling allows fast and high-throughput determination of allele frequencies in Atlantic salmon enabling cost-efficient identification of informative markers for discrimination of populations at various geographical scales, as well as identification of loci controlling ecologically and economically important traits. BioMed Central 2013-01-16 /pmc/articles/PMC3575319/ /pubmed/23324082 http://dx.doi.org/10.1186/1471-2164-14-12 Text en Copyright ©2013 Ozerov et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ozerov, Mikhail
Vasemägi, Anti
Wennevik, Vidar
Niemelä, Eero
Prusov, Sergey
Kent, Matthew
Vähä, Juha-Pekka
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title_full Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title_fullStr Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title_full_unstemmed Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title_short Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.)
title_sort cost-effective genome-wide estimation of allele frequencies from pooled dna in atlantic salmon (salmo salar l.)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575319/
https://www.ncbi.nlm.nih.gov/pubmed/23324082
http://dx.doi.org/10.1186/1471-2164-14-12
work_keys_str_mv AT ozerovmikhail costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT vasemagianti costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT wennevikvidar costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT niemelaeero costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT prusovsergey costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT kentmatthew costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl
AT vahajuhapekka costeffectivegenomewideestimationofallelefrequenciesfrompooleddnainatlanticsalmonsalmosalarl