Cargando…
Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data
Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and diffe...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900143/ https://www.ncbi.nlm.nih.gov/pubmed/31215703 http://dx.doi.org/10.1111/jbg.12419 |
_version_ | 1783477290451075072 |
---|---|
author | van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Lopes, Marcos S. Veerkamp, Roel F. |
author_facet | van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Lopes, Marcos S. Veerkamp, Roel F. |
author_sort | van den Berg, Sanne |
collection | PubMed |
description | Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole‐genome sequence data, fitting a genomic relationship matrix based on a leave‐one–chromosome‐out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole‐genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini–Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave‐one‐chromosome‐out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests. |
format | Online Article Text |
id | pubmed-6900143 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-69001432019-12-20 Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Lopes, Marcos S. Veerkamp, Roel F. J Anim Breed Genet Original Articles Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole‐genome sequence data, fitting a genomic relationship matrix based on a leave‐one–chromosome‐out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole‐genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini–Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave‐one‐chromosome‐out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests. John Wiley and Sons Inc. 2019-06-19 2019-11 /pmc/articles/PMC6900143/ /pubmed/31215703 http://dx.doi.org/10.1111/jbg.12419 Text en © 2019 The Authors. Journal of Animal Breeding and Genetics Published by Blackwell Verlag GmbH This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Articles van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Lopes, Marcos S. Veerkamp, Roel F. Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title | Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title_full | Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title_fullStr | Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title_full_unstemmed | Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title_short | Significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
title_sort | significance testing and genomic inflation factor using high‐density genotypes or whole‐genome sequence data |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900143/ https://www.ncbi.nlm.nih.gov/pubmed/31215703 http://dx.doi.org/10.1111/jbg.12419 |
work_keys_str_mv | AT vandenbergsanne significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata AT vandenplasjeremie significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata AT vaneeuwijkfreda significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata AT lopesmarcoss significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata AT veerkamproelf significancetestingandgenomicinflationfactorusinghighdensitygenotypesorwholegenomesequencedata |