Cargando…

Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite...

Descripción completa

Detalles Bibliográficos
Autores principales: Mullen, Michael P, Creevey, Christopher J, Berry, Donagh P, McCabe, Matt S, Magee, David A, Howard, Dawn J, Killeen, Aideen P, Park, Stephen D, McGettigan, Paul A, Lucy, Matt C, MacHugh, David E, Waters, Sinead M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315736/
https://www.ncbi.nlm.nih.gov/pubmed/22235840
http://dx.doi.org/10.1186/1471-2164-13-16
_version_ 1782228281812058112
author Mullen, Michael P
Creevey, Christopher J
Berry, Donagh P
McCabe, Matt S
Magee, David A
Howard, Dawn J
Killeen, Aideen P
Park, Stephen D
McGettigan, Paul A
Lucy, Matt C
MacHugh, David E
Waters, Sinead M
author_facet Mullen, Michael P
Creevey, Christopher J
Berry, Donagh P
McCabe, Matt S
Magee, David A
Howard, Dawn J
Killeen, Aideen P
Park, Stephen D
McGettigan, Paul A
Lucy, Matt C
MacHugh, David E
Waters, Sinead M
author_sort Mullen, Michael P
collection PubMed
description BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. RESULTS: In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom(® )MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). CONCLUSIONS: The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait.
format Online
Article
Text
id pubmed-3315736
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33157362012-03-31 Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples Mullen, Michael P Creevey, Christopher J Berry, Donagh P McCabe, Matt S Magee, David A Howard, Dawn J Killeen, Aideen P Park, Stephen D McGettigan, Paul A Lucy, Matt C MacHugh, David E Waters, Sinead M BMC Genomics Research Article BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. RESULTS: In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom(® )MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). CONCLUSIONS: The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. BioMed Central 2012-01-11 /pmc/articles/PMC3315736/ /pubmed/22235840 http://dx.doi.org/10.1186/1471-2164-13-16 Text en Copyright ©2012 Mullen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mullen, Michael P
Creevey, Christopher J
Berry, Donagh P
McCabe, Matt S
Magee, David A
Howard, Dawn J
Killeen, Aideen P
Park, Stephen D
McGettigan, Paul A
Lucy, Matt C
MacHugh, David E
Waters, Sinead M
Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title_full Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title_fullStr Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title_full_unstemmed Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title_short Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
title_sort polymorphism discovery and allele frequency estimation using high-throughput dna sequencing of target-enriched pooled dna samples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315736/
https://www.ncbi.nlm.nih.gov/pubmed/22235840
http://dx.doi.org/10.1186/1471-2164-13-16
work_keys_str_mv AT mullenmichaelp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT creeveychristopherj polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT berrydonaghp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT mccabematts polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT mageedavida polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT howarddawnj polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT killeenaideenp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT parkstephend polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT mcgettiganpaula polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT lucymattc polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT machughdavide polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples
AT waterssineadm polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples