Cargando…

Genetic association tests in family samples for multi-category phenotypes

BACKGROUND: Advancements in statistical methods and sequencing technology have led to numerous novel discoveries in human genetics in the past two decades. Among phenotypes of interest, most attention has been given to studying genetic associations with continuous or binary traits. Efficient statist...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shuai, Meigs, James B., Dupuis, Josée
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642939/
https://www.ncbi.nlm.nih.gov/pubmed/34863089
http://dx.doi.org/10.1186/s12864-021-08107-x
_version_ 1784609775384788992
author Wang, Shuai
Meigs, James B.
Dupuis, Josée
author_facet Wang, Shuai
Meigs, James B.
Dupuis, Josée
author_sort Wang, Shuai
collection PubMed
description BACKGROUND: Advancements in statistical methods and sequencing technology have led to numerous novel discoveries in human genetics in the past two decades. Among phenotypes of interest, most attention has been given to studying genetic associations with continuous or binary traits. Efficient statistical methods have been proposed and are available for both types of traits under different study designs. However, for multinomial categorical traits in related samples, there is a lack of efficient statistical methods and software. RESULTS: We propose an efficient score test to analyze a multinomial trait in family samples, in the context of genome-wide association/sequencing studies. An alternative Wald statistic is also proposed. We also extend the methodology to be applicable to ordinal traits. We performed extensive simulation studies to evaluate the type-I error of the score test, Wald test compared to the multinomial logistic regression for unrelated samples, under different allele frequency and study designs. We also evaluate the power of these methods. Results show that both the score and Wald tests have a well-controlled type-I error rate, but the multinomial logistic regression has an inflated type-I error rate when applied to family samples. We illustrated the application of the score test with an application to the Framingham Heart Study to uncover genetic variants associated with diabesity, a multi-category phenotype. CONCLUSION: Both proposed tests have correct type-I error rate and similar power. However, because the Wald statistics rely on computer-intensive estimation, it is less efficient than the score test in terms of applications to large-scale genetic association studies. We provide computer implementation for both multinomial and ordinal traits. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-08107-x.
format Online
Article
Text
id pubmed-8642939
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86429392021-12-06 Genetic association tests in family samples for multi-category phenotypes Wang, Shuai Meigs, James B. Dupuis, Josée BMC Genomics Research Article BACKGROUND: Advancements in statistical methods and sequencing technology have led to numerous novel discoveries in human genetics in the past two decades. Among phenotypes of interest, most attention has been given to studying genetic associations with continuous or binary traits. Efficient statistical methods have been proposed and are available for both types of traits under different study designs. However, for multinomial categorical traits in related samples, there is a lack of efficient statistical methods and software. RESULTS: We propose an efficient score test to analyze a multinomial trait in family samples, in the context of genome-wide association/sequencing studies. An alternative Wald statistic is also proposed. We also extend the methodology to be applicable to ordinal traits. We performed extensive simulation studies to evaluate the type-I error of the score test, Wald test compared to the multinomial logistic regression for unrelated samples, under different allele frequency and study designs. We also evaluate the power of these methods. Results show that both the score and Wald tests have a well-controlled type-I error rate, but the multinomial logistic regression has an inflated type-I error rate when applied to family samples. We illustrated the application of the score test with an application to the Framingham Heart Study to uncover genetic variants associated with diabesity, a multi-category phenotype. CONCLUSION: Both proposed tests have correct type-I error rate and similar power. However, because the Wald statistics rely on computer-intensive estimation, it is less efficient than the score test in terms of applications to large-scale genetic association studies. We provide computer implementation for both multinomial and ordinal traits. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-08107-x. BioMed Central 2021-12-04 /pmc/articles/PMC8642939/ /pubmed/34863089 http://dx.doi.org/10.1186/s12864-021-08107-x Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Wang, Shuai
Meigs, James B.
Dupuis, Josée
Genetic association tests in family samples for multi-category phenotypes
title Genetic association tests in family samples for multi-category phenotypes
title_full Genetic association tests in family samples for multi-category phenotypes
title_fullStr Genetic association tests in family samples for multi-category phenotypes
title_full_unstemmed Genetic association tests in family samples for multi-category phenotypes
title_short Genetic association tests in family samples for multi-category phenotypes
title_sort genetic association tests in family samples for multi-category phenotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642939/
https://www.ncbi.nlm.nih.gov/pubmed/34863089
http://dx.doi.org/10.1186/s12864-021-08107-x
work_keys_str_mv AT wangshuai geneticassociationtestsinfamilysamplesformulticategoryphenotypes
AT meigsjamesb geneticassociationtestsinfamilysamplesformulticategoryphenotypes
AT dupuisjosee geneticassociationtestsinfamilysamplesformulticategoryphenotypes