Cargando…

Integrating multiple genomic data: sparse representation based biomarker selection for blood pressure

BACKGROUND: Although many genes have been implicated as hypertension candidates, to date, few studies have integrated different types of genomic data for the purpose of biomarker selection. METHODS: Applying a newly proposed sparse representation based variable selection (SRVS) method to the Genetic...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Hongbao, Guo, Wei, Qin, Haide, Xu, Mengyuan, Lehrman, Benjamin, Tao, Yu, Shugart, Yin-Yao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133507/
https://www.ncbi.nlm.nih.gov/pubmed/27980650
http://dx.doi.org/10.1186/s12919-016-0044-7
Descripción
Sumario:BACKGROUND: Although many genes have been implicated as hypertension candidates, to date, few studies have integrated different types of genomic data for the purpose of biomarker selection. METHODS: Applying a newly proposed sparse representation based variable selection (SRVS) method to the Genetic Analysis Workshop19 data, we analyzed a combined data set consisting of 11522 gene expressions and 354893 single-nucleotide polymorphisms (SNPs) from 397 subjects (case/control: 151/246), with the aim to identify potential biomarkers for blood pressure using both gene expression measures and SNP data. RESULTS: Among the top 1000 variables (SNPs/gene expressions = 575/425) selected, the bioinformatics analysis showed that 302 were plausibly associated with blood pressure. In addition, we identified 173 variables that were associated with body weight and 84 associated with left ventricular contractility. Together, 55.9 % of the top 1000 variables showed associations with blood pressure related phenotypes(SNP/gene expression =348/211). CONCLUSIONS: Our results support the feasibility of the SRVS algorithm in integrating multiple data sets of different structure for comprehensive analysis.