Cargando…
A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data
Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the curren...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5659802/ https://www.ncbi.nlm.nih.gov/pubmed/29040274 http://dx.doi.org/10.1371/journal.pcbi.1005788 |
_version_ | 1783274211864739840 |
---|---|
author | Lin, Nan Zhu, Yun Fan, Ruzong Xiong, Momiao |
author_facet | Lin, Nan Zhu, Yun Fan, Ruzong Xiong, Momiao |
author_sort | Lin, Nan |
collection | PubMed |
description | Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics. |
format | Online Article Text |
id | pubmed-5659802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-56598022017-11-09 A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data Lin, Nan Zhu, Yun Fan, Ruzong Xiong, Momiao PLoS Comput Biol Research Article Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics. Public Library of Science 2017-10-17 /pmc/articles/PMC5659802/ /pubmed/29040274 http://dx.doi.org/10.1371/journal.pcbi.1005788 Text en © 2017 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lin, Nan Zhu, Yun Fan, Ruzong Xiong, Momiao A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title | A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title_full | A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title_fullStr | A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title_full_unstemmed | A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title_short | A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data |
title_sort | quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with ngs data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5659802/ https://www.ncbi.nlm.nih.gov/pubmed/29040274 http://dx.doi.org/10.1371/journal.pcbi.1005788 |
work_keys_str_mv | AT linnan aquadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT zhuyun aquadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT fanruzong aquadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT xiongmomiao aquadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT linnan quadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT zhuyun quadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT fanruzong quadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata AT xiongmomiao quadraticallyregularizedfunctionalcanonicalcorrelationanalysisforidentifyingtheglobalstructureofpleiotropywithngsdata |