Cargando…
A comparative analysis of family-based and population-based association tests using whole genome sequence data
The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies,...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143682/ https://www.ncbi.nlm.nih.gov/pubmed/25519381 http://dx.doi.org/10.1186/1753-6561-8-S1-S33 |
_version_ | 1782331937915928576 |
---|---|
author | Zhou, Jin J Yip, Wai-Ki Cho, Michael H Qiao, Dandi McDonald, Merry-Lynn N Laird, Nan M |
author_facet | Zhou, Jin J Yip, Wai-Ki Cho, Michael H Qiao, Dandi McDonald, Merry-Lynn N Laird, Nan M |
author_sort | Zhou, Jin J |
collection | PubMed |
description | The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/. |
format | Online Article Text |
id | pubmed-4143682 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41436822014-09-02 A comparative analysis of family-based and population-based association tests using whole genome sequence data Zhou, Jin J Yip, Wai-Ki Cho, Michael H Qiao, Dandi McDonald, Merry-Lynn N Laird, Nan M BMC Proc Proceedings The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/. BioMed Central 2014-06-17 /pmc/articles/PMC4143682/ /pubmed/25519381 http://dx.doi.org/10.1186/1753-6561-8-S1-S33 Text en Copyright © 2014 Zhou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Zhou, Jin J Yip, Wai-Ki Cho, Michael H Qiao, Dandi McDonald, Merry-Lynn N Laird, Nan M A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title | A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title_full | A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title_fullStr | A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title_full_unstemmed | A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title_short | A comparative analysis of family-based and population-based association tests using whole genome sequence data |
title_sort | comparative analysis of family-based and population-based association tests using whole genome sequence data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143682/ https://www.ncbi.nlm.nih.gov/pubmed/25519381 http://dx.doi.org/10.1186/1753-6561-8-S1-S33 |
work_keys_str_mv | AT zhoujinj acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT yipwaiki acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT chomichaelh acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT qiaodandi acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT mcdonaldmerrylynnn acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT lairdnanm acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT zhoujinj comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT yipwaiki comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT chomichaelh comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT qiaodandi comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT mcdonaldmerrylynnn comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata AT lairdnanm comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata |