Cargando…

Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of inte...

Descripción completa

Detalles Bibliográficos
Autores principales: Silver, Matt, Chen, Peng, Li, Ruoying, Cheng, Ching-Yu, Wong, Tien-Yin, Tai, E-Shyong, Teo, Yik-Ying, Montana, Giovanni
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3836716/
https://www.ncbi.nlm.nih.gov/pubmed/24278029
http://dx.doi.org/10.1371/journal.pgen.1003939
_version_ 1782292334480719872
author Silver, Matt
Chen, Peng
Li, Ruoying
Cheng, Ching-Yu
Wong, Tien-Yin
Tai, E-Shyong
Teo, Yik-Ying
Montana, Giovanni
author_facet Silver, Matt
Chen, Peng
Li, Ruoying
Cheng, Ching-Yu
Wong, Tien-Yin
Tai, E-Shyong
Teo, Yik-Ying
Montana, Giovanni
author_sort Silver, Matt
collection PubMed
description Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.
format Online
Article
Text
id pubmed-3836716
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38367162013-11-25 Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts Silver, Matt Chen, Peng Li, Ruoying Cheng, Ching-Yu Wong, Tien-Yin Tai, E-Shyong Teo, Yik-Ying Montana, Giovanni PLoS Genet Research Article Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function. Public Library of Science 2013-11-21 /pmc/articles/PMC3836716/ /pubmed/24278029 http://dx.doi.org/10.1371/journal.pgen.1003939 Text en © 2013 Silver et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Silver, Matt
Chen, Peng
Li, Ruoying
Cheng, Ching-Yu
Wong, Tien-Yin
Tai, E-Shyong
Teo, Yik-Ying
Montana, Giovanni
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title_full Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title_fullStr Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title_full_unstemmed Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title_short Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts
title_sort pathways-driven sparse regression identifies pathways and genes associated with high-density lipoprotein cholesterol in two asian cohorts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3836716/
https://www.ncbi.nlm.nih.gov/pubmed/24278029
http://dx.doi.org/10.1371/journal.pgen.1003939
work_keys_str_mv AT silvermatt pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT chenpeng pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT liruoying pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT chengchingyu pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT wongtienyin pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT taieshyong pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT teoyikying pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts
AT montanagiovanni pathwaysdrivensparseregressionidentifiespathwaysandgenesassociatedwithhighdensitylipoproteincholesterolintwoasiancohorts