Cargando…
Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples
BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315736/ https://www.ncbi.nlm.nih.gov/pubmed/22235840 http://dx.doi.org/10.1186/1471-2164-13-16 |
_version_ | 1782228281812058112 |
---|---|
author | Mullen, Michael P Creevey, Christopher J Berry, Donagh P McCabe, Matt S Magee, David A Howard, Dawn J Killeen, Aideen P Park, Stephen D McGettigan, Paul A Lucy, Matt C MacHugh, David E Waters, Sinead M |
author_facet | Mullen, Michael P Creevey, Christopher J Berry, Donagh P McCabe, Matt S Magee, David A Howard, Dawn J Killeen, Aideen P Park, Stephen D McGettigan, Paul A Lucy, Matt C MacHugh, David E Waters, Sinead M |
author_sort | Mullen, Michael P |
collection | PubMed |
description | BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. RESULTS: In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom(® )MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). CONCLUSIONS: The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. |
format | Online Article Text |
id | pubmed-3315736 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33157362012-03-31 Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples Mullen, Michael P Creevey, Christopher J Berry, Donagh P McCabe, Matt S Magee, David A Howard, Dawn J Killeen, Aideen P Park, Stephen D McGettigan, Paul A Lucy, Matt C MacHugh, David E Waters, Sinead M BMC Genomics Research Article BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. RESULTS: In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom(® )MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). CONCLUSIONS: The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. BioMed Central 2012-01-11 /pmc/articles/PMC3315736/ /pubmed/22235840 http://dx.doi.org/10.1186/1471-2164-13-16 Text en Copyright ©2012 Mullen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Mullen, Michael P Creevey, Christopher J Berry, Donagh P McCabe, Matt S Magee, David A Howard, Dawn J Killeen, Aideen P Park, Stephen D McGettigan, Paul A Lucy, Matt C MacHugh, David E Waters, Sinead M Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title | Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title_full | Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title_fullStr | Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title_full_unstemmed | Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title_short | Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples |
title_sort | polymorphism discovery and allele frequency estimation using high-throughput dna sequencing of target-enriched pooled dna samples |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315736/ https://www.ncbi.nlm.nih.gov/pubmed/22235840 http://dx.doi.org/10.1186/1471-2164-13-16 |
work_keys_str_mv | AT mullenmichaelp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT creeveychristopherj polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT berrydonaghp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT mccabematts polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT mageedavida polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT howarddawnj polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT killeenaideenp polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT parkstephend polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT mcgettiganpaula polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT lucymattc polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT machughdavide polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples AT waterssineadm polymorphismdiscoveryandallelefrequencyestimationusinghighthroughputdnasequencingoftargetenrichedpooleddnasamples |