Cargando…

Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data

Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: van den Berg, Sanne, Vandenplas, Jérémie, van Eeuwijk, Fred A., Lopes, Marcos S., Veerkamp, Roel F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900143/
https://www.ncbi.nlm.nih.gov/pubmed/31215703
http://dx.doi.org/10.1111/jbg.12419
_version_ 1783477290451075072
author van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Lopes, Marcos S.
Veerkamp, Roel F.
author_facet van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Lopes, Marcos S.
Veerkamp, Roel F.
author_sort van den Berg, Sanne
collection PubMed
description Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole‐genome sequence data, fitting a genomic relationship matrix based on a leave‐one–chromosome‐out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole‐genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini–Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave‐one‐chromosome‐out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.
format Online
Article
Text
id pubmed-6900143
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-69001432019-12-20 Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Lopes, Marcos S. Veerkamp, Roel F. J Anim Breed Genet Original Articles Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole‐genome sequence data, fitting a genomic relationship matrix based on a leave‐one–chromosome‐out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole‐genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini–Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave‐one‐chromosome‐out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests. John Wiley and Sons Inc. 2019-06-19 2019-11 /pmc/articles/PMC6900143/ /pubmed/31215703 http://dx.doi.org/10.1111/jbg.12419 Text en © 2019 The Authors. Journal of Animal Breeding and Genetics Published by Blackwell Verlag GmbH This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Lopes, Marcos S.
Veerkamp, Roel F.
Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title_full Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title_fullStr Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title_full_unstemmed Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title_short Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
title_sort significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900143/
https://www.ncbi.nlm.nih.gov/pubmed/31215703
http://dx.doi.org/10.1111/jbg.12419
work_keys_str_mv AT vandenbergsanne significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata
AT vandenplasjeremie significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata
AT vaneeuwijkfreda significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata
AT lopesmarcoss significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata
AT veerkamproelf significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata