Cargando…

A comparative analysis of family-based and population-based association tests using whole genome sequence data

The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Jin J, Yip, Wai-Ki, Cho, Michael H, Qiao, Dandi, McDonald, Merry-Lynn N, Laird, Nan M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143682/
https://www.ncbi.nlm.nih.gov/pubmed/25519381
http://dx.doi.org/10.1186/1753-6561-8-S1-S33
_version_ 1782331937915928576
author Zhou, Jin J
Yip, Wai-Ki
Cho, Michael H
Qiao, Dandi
McDonald, Merry-Lynn N
Laird, Nan M
author_facet Zhou, Jin J
Yip, Wai-Ki
Cho, Michael H
Qiao, Dandi
McDonald, Merry-Lynn N
Laird, Nan M
author_sort Zhou, Jin J
collection PubMed
description The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/.
format Online
Article
Text
id pubmed-4143682
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41436822014-09-02 A comparative analysis of family-based and population-based association tests using whole genome sequence data Zhou, Jin J Yip, Wai-Ki Cho, Michael H Qiao, Dandi McDonald, Merry-Lynn N Laird, Nan M BMC Proc Proceedings The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/. BioMed Central 2014-06-17 /pmc/articles/PMC4143682/ /pubmed/25519381 http://dx.doi.org/10.1186/1753-6561-8-S1-S33 Text en Copyright © 2014 Zhou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Zhou, Jin J
Yip, Wai-Ki
Cho, Michael H
Qiao, Dandi
McDonald, Merry-Lynn N
Laird, Nan M
A comparative analysis of family-based and population-based association tests using whole genome sequence data
title A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_full A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_fullStr A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_full_unstemmed A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_short A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_sort comparative analysis of family-based and population-based association tests using whole genome sequence data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143682/
https://www.ncbi.nlm.nih.gov/pubmed/25519381
http://dx.doi.org/10.1186/1753-6561-8-S1-S33
work_keys_str_mv AT zhoujinj acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT yipwaiki acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT chomichaelh acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT qiaodandi acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT mcdonaldmerrylynnn acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT lairdnanm acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT zhoujinj comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT yipwaiki comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT chomichaelh comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT qiaodandi comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT mcdonaldmerrylynnn comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT lairdnanm comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata