Cargando…

Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants

Comprehensive characterization of a gene's impact on phenotypes requires knowledge of the context of the gene. To address this issue we introduce a systematic data integration method Candidate Genes and SNPs (CANGES) that links SNP and linkage disequilibrium data to pathway- and protein-protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Karinen, Sirkku, Heikkinen, Tuomas, Nevanlinna, Heli, Hautaniemi, Sampsa
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3075259/
https://www.ncbi.nlm.nih.gov/pubmed/21533266
http://dx.doi.org/10.1371/journal.pone.0018636
_version_ 1782201760782221312
author Karinen, Sirkku
Heikkinen, Tuomas
Nevanlinna, Heli
Hautaniemi, Sampsa
author_facet Karinen, Sirkku
Heikkinen, Tuomas
Nevanlinna, Heli
Hautaniemi, Sampsa
author_sort Karinen, Sirkku
collection PubMed
description Comprehensive characterization of a gene's impact on phenotypes requires knowledge of the context of the gene. To address this issue we introduce a systematic data integration method Candidate Genes and SNPs (CANGES) that links SNP and linkage disequilibrium data to pathway- and protein-protein interaction information. It can be used as a knowledge discovery tool for the search of disease associated causative variants from genome-wide studies as well as to generate new hypotheses on synergistically functioning genes. We demonstrate the utility of CANGES by integrating pathway and protein-protein interaction data to identify putative functional variants for (i) the p53 gene and (ii) three glioblastoma multiforme (GBM) associated risk genes. For the GBM case, we further integrate the CANGES results with clinical and genome-wide data for 209 GBM patients and identify genes having effects on GBM patient survival. Our results show that selecting a focused set of genes can result in information beyond the traditional genome-wide association approaches. Taken together, holistic approach to identify possible interacting genes and SNPs with CANGES provides a means to rapidly identify networks for any set of genes and generate novel hypotheses. CANGES is available in http://csbi.ltdk.helsinki.fi/CANGES/
format Text
id pubmed-3075259
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30752592011-04-29 Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants Karinen, Sirkku Heikkinen, Tuomas Nevanlinna, Heli Hautaniemi, Sampsa PLoS One Research Article Comprehensive characterization of a gene's impact on phenotypes requires knowledge of the context of the gene. To address this issue we introduce a systematic data integration method Candidate Genes and SNPs (CANGES) that links SNP and linkage disequilibrium data to pathway- and protein-protein interaction information. It can be used as a knowledge discovery tool for the search of disease associated causative variants from genome-wide studies as well as to generate new hypotheses on synergistically functioning genes. We demonstrate the utility of CANGES by integrating pathway and protein-protein interaction data to identify putative functional variants for (i) the p53 gene and (ii) three glioblastoma multiforme (GBM) associated risk genes. For the GBM case, we further integrate the CANGES results with clinical and genome-wide data for 209 GBM patients and identify genes having effects on GBM patient survival. Our results show that selecting a focused set of genes can result in information beyond the traditional genome-wide association approaches. Taken together, holistic approach to identify possible interacting genes and SNPs with CANGES provides a means to rapidly identify networks for any set of genes and generate novel hypotheses. CANGES is available in http://csbi.ltdk.helsinki.fi/CANGES/ Public Library of Science 2011-04-12 /pmc/articles/PMC3075259/ /pubmed/21533266 http://dx.doi.org/10.1371/journal.pone.0018636 Text en Karinen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Karinen, Sirkku
Heikkinen, Tuomas
Nevanlinna, Heli
Hautaniemi, Sampsa
Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title_full Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title_fullStr Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title_full_unstemmed Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title_short Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants
title_sort data integration workflow for search of disease driving genes and genetic variants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3075259/
https://www.ncbi.nlm.nih.gov/pubmed/21533266
http://dx.doi.org/10.1371/journal.pone.0018636
work_keys_str_mv AT karinensirkku dataintegrationworkflowforsearchofdiseasedrivinggenesandgeneticvariants
AT heikkinentuomas dataintegrationworkflowforsearchofdiseasedrivinggenesandgeneticvariants
AT nevanlinnaheli dataintegrationworkflowforsearchofdiseasedrivinggenesandgeneticvariants
AT hautaniemisampsa dataintegrationworkflowforsearchofdiseasedrivinggenesandgeneticvariants