Cargando…

Disease gene prediction for molecularly uncharacterized diseases

Network medicine approaches have been largely successful at increasing our knowledge of molecularly characterized diseases. Given a set of disease genes associated with a disease, neighbourhood-based methods and random walkers exploit the interactome allowing the prediction of further genes for that...

Descripción completa

Detalles Bibliográficos
Autores principales: Cáceres, Juan J., Paccanaro, Alberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636748/
https://www.ncbi.nlm.nih.gov/pubmed/31276496
http://dx.doi.org/10.1371/journal.pcbi.1007078
_version_ 1783436118397550592
author Cáceres, Juan J.
Paccanaro, Alberto
author_facet Cáceres, Juan J.
Paccanaro, Alberto
author_sort Cáceres, Juan J.
collection PubMed
description Network medicine approaches have been largely successful at increasing our knowledge of molecularly characterized diseases. Given a set of disease genes associated with a disease, neighbourhood-based methods and random walkers exploit the interactome allowing the prediction of further genes for that disease. In general, however, diseases with no known molecular basis constitute a challenge. Here we present a novel network approach to prioritize gene-disease associations that is able to also predict genes for diseases with no known molecular basis. Our method, which we have called Cardigan (ChARting DIsease Gene AssociatioNs), uses semi-supervised learning and exploits a measure of similarity between disease phenotypes. We evaluated its performance at predicting genes for both molecularly characterized and uncharacterized diseases in OMIM, using both weighted and binary interactomes, and compared it with state-of-the-art methods. Our tests, which use datasets collected at different points in time to replicate the dynamics of the disease gene discovery process, prove that Cardigan is able to accurately predict disease genes for molecularly uncharacterized diseases. Additionally, standard leave-one-out cross validation tests show how our approach outperforms state-of-the-art methods at predicting genes for molecularly characterized diseases by 14%-65%. Cardigan can also be used for disease module prediction, where it outperforms state-of-the-art methods by 87%-299%.
format Online
Article
Text
id pubmed-6636748
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66367482019-07-25 Disease gene prediction for molecularly uncharacterized diseases Cáceres, Juan J. Paccanaro, Alberto PLoS Comput Biol Research Article Network medicine approaches have been largely successful at increasing our knowledge of molecularly characterized diseases. Given a set of disease genes associated with a disease, neighbourhood-based methods and random walkers exploit the interactome allowing the prediction of further genes for that disease. In general, however, diseases with no known molecular basis constitute a challenge. Here we present a novel network approach to prioritize gene-disease associations that is able to also predict genes for diseases with no known molecular basis. Our method, which we have called Cardigan (ChARting DIsease Gene AssociatioNs), uses semi-supervised learning and exploits a measure of similarity between disease phenotypes. We evaluated its performance at predicting genes for both molecularly characterized and uncharacterized diseases in OMIM, using both weighted and binary interactomes, and compared it with state-of-the-art methods. Our tests, which use datasets collected at different points in time to replicate the dynamics of the disease gene discovery process, prove that Cardigan is able to accurately predict disease genes for molecularly uncharacterized diseases. Additionally, standard leave-one-out cross validation tests show how our approach outperforms state-of-the-art methods at predicting genes for molecularly characterized diseases by 14%-65%. Cardigan can also be used for disease module prediction, where it outperforms state-of-the-art methods by 87%-299%. Public Library of Science 2019-07-05 /pmc/articles/PMC6636748/ /pubmed/31276496 http://dx.doi.org/10.1371/journal.pcbi.1007078 Text en © 2019 Cáceres, Paccanaro http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cáceres, Juan J.
Paccanaro, Alberto
Disease gene prediction for molecularly uncharacterized diseases
title Disease gene prediction for molecularly uncharacterized diseases
title_full Disease gene prediction for molecularly uncharacterized diseases
title_fullStr Disease gene prediction for molecularly uncharacterized diseases
title_full_unstemmed Disease gene prediction for molecularly uncharacterized diseases
title_short Disease gene prediction for molecularly uncharacterized diseases
title_sort disease gene prediction for molecularly uncharacterized diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636748/
https://www.ncbi.nlm.nih.gov/pubmed/31276496
http://dx.doi.org/10.1371/journal.pcbi.1007078
work_keys_str_mv AT caceresjuanj diseasegenepredictionformolecularlyuncharacterizeddiseases
AT paccanaroalberto diseasegenepredictionformolecularlyuncharacterizeddiseases