Cargando…
AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes
Genome-wide association study (GWAS) has identified thousands of genetic variants associated with complex traits and diseases. Compared with analyzing a single phenotype at a time, the joint analysis of multiple phenotypes can improve statistical power by taking into account the information from phe...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8107386/ https://www.ncbi.nlm.nih.gov/pubmed/33981331 http://dx.doi.org/10.3389/fgene.2021.648831 |
_version_ | 1783689938053627904 |
---|---|
author | Liu, Fengrong Zhou, Ziyang Cai, Mingzhi Wen, Yangjun Zhang, Jin |
author_facet | Liu, Fengrong Zhou, Ziyang Cai, Mingzhi Wen, Yangjun Zhang, Jin |
author_sort | Liu, Fengrong |
collection | PubMed |
description | Genome-wide association study (GWAS) has identified thousands of genetic variants associated with complex traits and diseases. Compared with analyzing a single phenotype at a time, the joint analysis of multiple phenotypes can improve statistical power by taking into account the information from phenotypes. However, most established joint algorithms ignore the different level of correlations between multiple phenotypes; instead of that, they simultaneously analyze all phenotypes in a genetic model. Thus, they may fail to capture the genetic structure of phenotypes and consequently reduce the statistical power. In this study, we develop a novel method agglomerative nesting clustering algorithm for phenotypic dimension reduction analysis (AGNEP) to jointly analyze multiple phenotypes for GWAS. First, AGNEP uses an agglomerative nesting clustering algorithm to group correlated phenotypes and then applies principal component analysis (PCA) to generate representative phenotypes for each group. Finally, multivariate analysis is employed to test associations between genetic variants and the representative phenotypes rather than all phenotypes. We perform three simulation experiments with various genetic structures and a real dataset analysis for 19 Arabidopsis phenotypes. Compared to established methods, AGNEP is more powerful in terms of statistical power, computing time, and the number of quantitative trait nucleotides (QTNs). The analysis of the Arabidopsis real dataset further illustrates the efficiency of AGNEP for detecting QTNs, which are confirmed by The Arabidopsis Information Resource gene bank. |
format | Online Article Text |
id | pubmed-8107386 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81073862021-05-11 AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes Liu, Fengrong Zhou, Ziyang Cai, Mingzhi Wen, Yangjun Zhang, Jin Front Genet Genetics Genome-wide association study (GWAS) has identified thousands of genetic variants associated with complex traits and diseases. Compared with analyzing a single phenotype at a time, the joint analysis of multiple phenotypes can improve statistical power by taking into account the information from phenotypes. However, most established joint algorithms ignore the different level of correlations between multiple phenotypes; instead of that, they simultaneously analyze all phenotypes in a genetic model. Thus, they may fail to capture the genetic structure of phenotypes and consequently reduce the statistical power. In this study, we develop a novel method agglomerative nesting clustering algorithm for phenotypic dimension reduction analysis (AGNEP) to jointly analyze multiple phenotypes for GWAS. First, AGNEP uses an agglomerative nesting clustering algorithm to group correlated phenotypes and then applies principal component analysis (PCA) to generate representative phenotypes for each group. Finally, multivariate analysis is employed to test associations between genetic variants and the representative phenotypes rather than all phenotypes. We perform three simulation experiments with various genetic structures and a real dataset analysis for 19 Arabidopsis phenotypes. Compared to established methods, AGNEP is more powerful in terms of statistical power, computing time, and the number of quantitative trait nucleotides (QTNs). The analysis of the Arabidopsis real dataset further illustrates the efficiency of AGNEP for detecting QTNs, which are confirmed by The Arabidopsis Information Resource gene bank. Frontiers Media S.A. 2021-04-26 /pmc/articles/PMC8107386/ /pubmed/33981331 http://dx.doi.org/10.3389/fgene.2021.648831 Text en Copyright © 2021 Liu, Zhou, Cai, Wen and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Fengrong Zhou, Ziyang Cai, Mingzhi Wen, Yangjun Zhang, Jin AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title | AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title_full | AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title_fullStr | AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title_full_unstemmed | AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title_short | AGNEP: An Agglomerative Nesting Clustering Algorithm for Phenotypic Dimension Reduction in Joint Analysis of Multiple Phenotypes |
title_sort | agnep: an agglomerative nesting clustering algorithm for phenotypic dimension reduction in joint analysis of multiple phenotypes |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8107386/ https://www.ncbi.nlm.nih.gov/pubmed/33981331 http://dx.doi.org/10.3389/fgene.2021.648831 |
work_keys_str_mv | AT liufengrong agnepanagglomerativenestingclusteringalgorithmforphenotypicdimensionreductioninjointanalysisofmultiplephenotypes AT zhouziyang agnepanagglomerativenestingclusteringalgorithmforphenotypicdimensionreductioninjointanalysisofmultiplephenotypes AT caimingzhi agnepanagglomerativenestingclusteringalgorithmforphenotypicdimensionreductioninjointanalysisofmultiplephenotypes AT wenyangjun agnepanagglomerativenestingclusteringalgorithmforphenotypicdimensionreductioninjointanalysisofmultiplephenotypes AT zhangjin agnepanagglomerativenestingclusteringalgorithmforphenotypicdimensionreductioninjointanalysisofmultiplephenotypes |