Cargando…

SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools

BACKGROUND: Next-generation sequencing technologies are widely used for detection of millions of Single Nucleotide Polymorphisms (SNPs) and also provide a means of assessing their variation. This information is useful for composing subsets of highly informative SNPs for region-specific or genome-wid...

Descripción completa

Detalles Bibliográficos
Autores principales: Marklund, Stefan, Carlborg, Örjan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091782/
https://www.ncbi.nlm.nih.gov/pubmed/21108801
http://dx.doi.org/10.1186/1471-2164-11-665
_version_ 1782203326920654848
author Marklund, Stefan
Carlborg, Örjan
author_facet Marklund, Stefan
Carlborg, Örjan
author_sort Marklund, Stefan
collection PubMed
description BACKGROUND: Next-generation sequencing technologies are widely used for detection of millions of Single Nucleotide Polymorphisms (SNPs) and also provide a means of assessing their variation. This information is useful for composing subsets of highly informative SNPs for region-specific or genome-wide analysis and to identify mutations regulating phenotypic differences within or between populations. In this study, we investigated the sensitivity of SNP detection and introduced the flanking SNPs value (FSV) as a novel measure for predicting SNP-variability using ~5X genome resequencing with ABI SOLID and DNA pools from two chicken lines divergently selected for juvenile bodyweight. RESULTS: Genotyping with a 60 K SNP chip revealed polymorphisms within or between two divergently selected chicken lines for 31 363 SNPs, 48% of which were also detected using resequencing of DNA pools. SNP detection using resequencing was more powerful for positions with larger differences in allele frequency between the lines. About 50% of the SNPs with non-reference allele frequencies in the range 0.5-0.6 and 67% of those with frequencies > 0.9 could be detected. On average, ~3.7 SNPs/kb were detected by resequencing, with about 5% lower density on microchromosomes than on macrochromosomes. There was a positive correlation between the observed between-line SNP variation from the 60 K chip analysis and our proposed FSV score computed from the genome resequencing data. The strongest correlations on macrochromosomes and microchromosomes were observed when the FSV was calculated with total flanking regions of 62 kb (correlation 0.55) and 38 kb (correlation 0.45), respectively. CONCLUSIONS: Genome resequencing with limited coverage (~5X) using pooled DNA samples and three non-reference reads as a threshold for SNP detection, identified 50 - 67% of the 60 K SNPs with a non-reference allele frequency larger than 0.5. The SNP density was around 5% lower on the microchromosomes, most likely because of their higher gene content. Our proposed method to estimate the SNP variation (FSV) uses additional sequence information to better predict SNP informativity. The FSV scores showed higher correlations for SNPs with a larger difference in allele frequency between the populations. The correlation was strongest on macrochromosomes, probably due to a lower recombination rate.
format Text
id pubmed-3091782
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30917822011-05-12 SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools Marklund, Stefan Carlborg, Örjan BMC Genomics Research Article BACKGROUND: Next-generation sequencing technologies are widely used for detection of millions of Single Nucleotide Polymorphisms (SNPs) and also provide a means of assessing their variation. This information is useful for composing subsets of highly informative SNPs for region-specific or genome-wide analysis and to identify mutations regulating phenotypic differences within or between populations. In this study, we investigated the sensitivity of SNP detection and introduced the flanking SNPs value (FSV) as a novel measure for predicting SNP-variability using ~5X genome resequencing with ABI SOLID and DNA pools from two chicken lines divergently selected for juvenile bodyweight. RESULTS: Genotyping with a 60 K SNP chip revealed polymorphisms within or between two divergently selected chicken lines for 31 363 SNPs, 48% of which were also detected using resequencing of DNA pools. SNP detection using resequencing was more powerful for positions with larger differences in allele frequency between the lines. About 50% of the SNPs with non-reference allele frequencies in the range 0.5-0.6 and 67% of those with frequencies > 0.9 could be detected. On average, ~3.7 SNPs/kb were detected by resequencing, with about 5% lower density on microchromosomes than on macrochromosomes. There was a positive correlation between the observed between-line SNP variation from the 60 K chip analysis and our proposed FSV score computed from the genome resequencing data. The strongest correlations on macrochromosomes and microchromosomes were observed when the FSV was calculated with total flanking regions of 62 kb (correlation 0.55) and 38 kb (correlation 0.45), respectively. CONCLUSIONS: Genome resequencing with limited coverage (~5X) using pooled DNA samples and three non-reference reads as a threshold for SNP detection, identified 50 - 67% of the 60 K SNPs with a non-reference allele frequency larger than 0.5. The SNP density was around 5% lower on the microchromosomes, most likely because of their higher gene content. Our proposed method to estimate the SNP variation (FSV) uses additional sequence information to better predict SNP informativity. The FSV scores showed higher correlations for SNPs with a larger difference in allele frequency between the populations. The correlation was strongest on macrochromosomes, probably due to a lower recombination rate. BioMed Central 2010-11-25 /pmc/articles/PMC3091782/ /pubmed/21108801 http://dx.doi.org/10.1186/1471-2164-11-665 Text en Copyright ©2010 Marklund and Carlborg; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Marklund, Stefan
Carlborg, Örjan
SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title_full SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title_fullStr SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title_full_unstemmed SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title_short SNP detection and prediction of variability between chicken lines using genome resequencing of DNA pools
title_sort snp detection and prediction of variability between chicken lines using genome resequencing of dna pools
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091782/
https://www.ncbi.nlm.nih.gov/pubmed/21108801
http://dx.doi.org/10.1186/1471-2164-11-665
work_keys_str_mv AT marklundstefan snpdetectionandpredictionofvariabilitybetweenchickenlinesusinggenomeresequencingofdnapools
AT carlborgorjan snpdetectionandpredictionofvariabilitybetweenchickenlinesusinggenomeresequencingofdnapools