Cargando…

Network Analysis of Differential Expression for the Identification of Disease-Causing Genes

Genetic studies (in particular linkage and association studies) identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Nitsch, Daniela, Tranchevent, Léon-Charles, Thienpont, Bernard, Thorrez, Lieven, Van Esch, Hilde, Devriendt, Koenraad, Moreau, Yves
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677677/
https://www.ncbi.nlm.nih.gov/pubmed/19436755
http://dx.doi.org/10.1371/journal.pone.0005526
_version_ 1782166792914862080
author Nitsch, Daniela
Tranchevent, Léon-Charles
Thienpont, Bernard
Thorrez, Lieven
Van Esch, Hilde
Devriendt, Koenraad
Moreau, Yves
author_facet Nitsch, Daniela
Tranchevent, Léon-Charles
Thienpont, Bernard
Thorrez, Lieven
Van Esch, Hilde
Devriendt, Koenraad
Moreau, Yves
author_sort Nitsch, Daniela
collection PubMed
description Genetic studies (in particular linkage and association studies) identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize) the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved). We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes.
format Text
id pubmed-2677677
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26776772009-05-13 Network Analysis of Differential Expression for the Identification of Disease-Causing Genes Nitsch, Daniela Tranchevent, Léon-Charles Thienpont, Bernard Thorrez, Lieven Van Esch, Hilde Devriendt, Koenraad Moreau, Yves PLoS One Research Article Genetic studies (in particular linkage and association studies) identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize) the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved). We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes. Public Library of Science 2009-05-13 /pmc/articles/PMC2677677/ /pubmed/19436755 http://dx.doi.org/10.1371/journal.pone.0005526 Text en Nitsch et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Nitsch, Daniela
Tranchevent, Léon-Charles
Thienpont, Bernard
Thorrez, Lieven
Van Esch, Hilde
Devriendt, Koenraad
Moreau, Yves
Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title_full Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title_fullStr Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title_full_unstemmed Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title_short Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
title_sort network analysis of differential expression for the identification of disease-causing genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677677/
https://www.ncbi.nlm.nih.gov/pubmed/19436755
http://dx.doi.org/10.1371/journal.pone.0005526
work_keys_str_mv AT nitschdaniela networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT trancheventleoncharles networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT thienpontbernard networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT thorrezlieven networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT vaneschhilde networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT devriendtkoenraad networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes
AT moreauyves networkanalysisofdifferentialexpressionfortheidentificationofdiseasecausinggenes