Cargando…

Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease

BACKGROUND: Alzheimer’s disease (AD) is one of the leading genetically complex and heterogeneous disorder that is influenced by both genetic and environmental factors. The underlying risk factors remain largely unclear for this heterogeneous disorder. In recent years, high throughput methodologies,...

Descripción completa

Detalles Bibliográficos
Autores principales: Talwar, Puneet, Silla, Yumnam, Grover, Sandeep, Gupta, Meenal, Agarwal, Rachna, Kushwaha, Suman, Kukreti, Ritushree
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4028079/
https://www.ncbi.nlm.nih.gov/pubmed/24628925
http://dx.doi.org/10.1186/1471-2164-15-199
_version_ 1782317031510507520
author Talwar, Puneet
Silla, Yumnam
Grover, Sandeep
Gupta, Meenal
Agarwal, Rachna
Kushwaha, Suman
Kukreti, Ritushree
author_facet Talwar, Puneet
Silla, Yumnam
Grover, Sandeep
Gupta, Meenal
Agarwal, Rachna
Kushwaha, Suman
Kukreti, Ritushree
author_sort Talwar, Puneet
collection PubMed
description BACKGROUND: Alzheimer’s disease (AD) is one of the leading genetically complex and heterogeneous disorder that is influenced by both genetic and environmental factors. The underlying risk factors remain largely unclear for this heterogeneous disorder. In recent years, high throughput methodologies, such as genome-wide linkage analysis (GWL), genome-wide association (GWA) studies, and genome-wide expression profiling (GWE), have led to the identification of several candidate genes associated with AD. However, due to lack of consistency within their findings, an integrative approach is warranted. Here, we have designed a rank based gene prioritization approach involving convergent analysis of multi-dimensional data and protein-protein interaction (PPI) network modelling. RESULTS: Our approach employs integration of three different AD datasets- GWL,GWA and GWE to identify overlapping candidate genes ranked using a novel cumulative rank score (S(R)) based method followed by prioritization using clusters derived from PPI network. S(R) for each gene is calculated by addition of rank assigned to individual gene based on either p value or score in three datasets. This analysis yielded 108 plausible AD genes. Network modelling by creating PPI using proteins encoded by these genes and their direct interactors resulted in a layered network of 640 proteins. Clustering of these proteins further helped us in identifying 6 significant clusters with 7 proteins (EGFR, ACTB, CDC2, IRAK1, APOE, ABCA1 and AMPH) forming the central hub nodes. Functional annotation of 108 genes revealed their role in several biological activities such as neurogenesis, regulation of MAP kinase activity, response to calcium ion, endocytosis paralleling the AD specific attributes. Finally, 3 potential biochemical biomarkers were found from the overlap of 108 AD proteins with proteins from CSF and plasma proteome. EGFR and ACTB were found to be the two most significant AD risk genes. CONCLUSIONS: With the assumption that common genetic signals obtained from different methodological platforms might serve as robust AD risk markers than candidates identified using single dimension approach, here we demonstrated an integrated genomic convergence approach for disease candidate gene prioritization from heterogeneous data sources linked to AD. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-199) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4028079
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40280792014-05-30 Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease Talwar, Puneet Silla, Yumnam Grover, Sandeep Gupta, Meenal Agarwal, Rachna Kushwaha, Suman Kukreti, Ritushree BMC Genomics Research Article BACKGROUND: Alzheimer’s disease (AD) is one of the leading genetically complex and heterogeneous disorder that is influenced by both genetic and environmental factors. The underlying risk factors remain largely unclear for this heterogeneous disorder. In recent years, high throughput methodologies, such as genome-wide linkage analysis (GWL), genome-wide association (GWA) studies, and genome-wide expression profiling (GWE), have led to the identification of several candidate genes associated with AD. However, due to lack of consistency within their findings, an integrative approach is warranted. Here, we have designed a rank based gene prioritization approach involving convergent analysis of multi-dimensional data and protein-protein interaction (PPI) network modelling. RESULTS: Our approach employs integration of three different AD datasets- GWL,GWA and GWE to identify overlapping candidate genes ranked using a novel cumulative rank score (S(R)) based method followed by prioritization using clusters derived from PPI network. S(R) for each gene is calculated by addition of rank assigned to individual gene based on either p value or score in three datasets. This analysis yielded 108 plausible AD genes. Network modelling by creating PPI using proteins encoded by these genes and their direct interactors resulted in a layered network of 640 proteins. Clustering of these proteins further helped us in identifying 6 significant clusters with 7 proteins (EGFR, ACTB, CDC2, IRAK1, APOE, ABCA1 and AMPH) forming the central hub nodes. Functional annotation of 108 genes revealed their role in several biological activities such as neurogenesis, regulation of MAP kinase activity, response to calcium ion, endocytosis paralleling the AD specific attributes. Finally, 3 potential biochemical biomarkers were found from the overlap of 108 AD proteins with proteins from CSF and plasma proteome. EGFR and ACTB were found to be the two most significant AD risk genes. CONCLUSIONS: With the assumption that common genetic signals obtained from different methodological platforms might serve as robust AD risk markers than candidates identified using single dimension approach, here we demonstrated an integrated genomic convergence approach for disease candidate gene prioritization from heterogeneous data sources linked to AD. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-199) contains supplementary material, which is available to authorized users. BioMed Central 2014-03-15 /pmc/articles/PMC4028079/ /pubmed/24628925 http://dx.doi.org/10.1186/1471-2164-15-199 Text en © Talwar et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Talwar, Puneet
Silla, Yumnam
Grover, Sandeep
Gupta, Meenal
Agarwal, Rachna
Kushwaha, Suman
Kukreti, Ritushree
Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title_full Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title_fullStr Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title_full_unstemmed Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title_short Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease
title_sort genomic convergence and network analysis approach to identify candidate genes in alzheimer's disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4028079/
https://www.ncbi.nlm.nih.gov/pubmed/24628925
http://dx.doi.org/10.1186/1471-2164-15-199
work_keys_str_mv AT talwarpuneet genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT sillayumnam genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT groversandeep genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT guptameenal genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT agarwalrachna genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT kushwahasuman genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease
AT kukretiritushree genomicconvergenceandnetworkanalysisapproachtoidentifycandidategenesinalzheimersdisease