Cargando…

An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations

Drug development is both increasing in cost whilst decreasing in productivity. There is a general acceptance that the current paradigm of R&D needs to change. One alternative approach is drug repositioning. With target-based approaches utilised heavily in the field of drug discovery, it becomes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mullen, Joseph, Cockell, Simon J., Woollard, Peter, Wipat, Anil
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873016/ https://www.ncbi.nlm.nih.gov/pubmed/27196054 http://dx.doi.org/10.1371/journal.pone.0155811

_version_	1782432821365702656
author	Mullen, Joseph Cockell, Simon J. Woollard, Peter Wipat, Anil
author_facet	Mullen, Joseph Cockell, Simon J. Woollard, Peter Wipat, Anil
author_sort	Mullen, Joseph
collection	PubMed
description	Drug development is both increasing in cost whilst decreasing in productivity. There is a general acceptance that the current paradigm of R&D needs to change. One alternative approach is drug repositioning. With target-based approaches utilised heavily in the field of drug discovery, it becomes increasingly necessary to have a systematic method to rank gene-disease associations. Although methods already exist to collect, integrate and score these associations, they are often not a reliable reflection of expert knowledge. Furthermore, the amount of data available in all areas covered by bioinformatics is increasing dramatically year on year. It thus makes sense to move away from more generalised hypothesis driven approaches to research to one that allows data to generate their own hypothesis. We introduce an integrated, data driven approach to drug repositioning. We first apply a Bayesian statistics approach to rank 309,885 gene-disease associations using existing knowledge. Ranked associations are then integrated with other biological data to produce a semantically-rich drug discovery network. Using this network, we show how our approach identifies diseases of the central nervous system (CNS) to be an area of interest. CNS disorders are identified due to the low numbers of such disorders that currently have marketed treatments, in comparison to other therapeutic areas. We then systematically mine our network for semantic subgraphs that allow us to infer drug-disease relations that are not captured in the network. We identify and rank 275,934 drug-disease has_indication associations after filtering those that are more likely to be side effects, whilst commenting on the top ranked associations in more detail. The dataset has been created in Neo4j and is available for download at https://bitbucket.org/ncl-intbio/genediseaserepositioning along with a Java implementation of the searching algorithm.
format	Online Article Text
id	pubmed-4873016
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-48730162016-06-09 An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations Mullen, Joseph Cockell, Simon J. Woollard, Peter Wipat, Anil PLoS One Research Article Drug development is both increasing in cost whilst decreasing in productivity. There is a general acceptance that the current paradigm of R&D needs to change. One alternative approach is drug repositioning. With target-based approaches utilised heavily in the field of drug discovery, it becomes increasingly necessary to have a systematic method to rank gene-disease associations. Although methods already exist to collect, integrate and score these associations, they are often not a reliable reflection of expert knowledge. Furthermore, the amount of data available in all areas covered by bioinformatics is increasing dramatically year on year. It thus makes sense to move away from more generalised hypothesis driven approaches to research to one that allows data to generate their own hypothesis. We introduce an integrated, data driven approach to drug repositioning. We first apply a Bayesian statistics approach to rank 309,885 gene-disease associations using existing knowledge. Ranked associations are then integrated with other biological data to produce a semantically-rich drug discovery network. Using this network, we show how our approach identifies diseases of the central nervous system (CNS) to be an area of interest. CNS disorders are identified due to the low numbers of such disorders that currently have marketed treatments, in comparison to other therapeutic areas. We then systematically mine our network for semantic subgraphs that allow us to infer drug-disease relations that are not captured in the network. We identify and rank 275,934 drug-disease has_indication associations after filtering those that are more likely to be side effects, whilst commenting on the top ranked associations in more detail. The dataset has been created in Neo4j and is available for download at https://bitbucket.org/ncl-intbio/genediseaserepositioning along with a Java implementation of the searching algorithm. Public Library of Science 2016-05-19 /pmc/articles/PMC4873016/ /pubmed/27196054 http://dx.doi.org/10.1371/journal.pone.0155811 Text en © 2016 Mullen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Mullen, Joseph Cockell, Simon J. Woollard, Peter Wipat, Anil An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title	An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title_full	An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title_fullStr	An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title_full_unstemmed	An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title_short	An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations
title_sort	integrated data driven approach to drug repositioning using gene-disease associations
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873016/ https://www.ncbi.nlm.nih.gov/pubmed/27196054 http://dx.doi.org/10.1371/journal.pone.0155811
work_keys_str_mv	AT mullenjoseph anintegrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT cockellsimonj anintegrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT woollardpeter anintegrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT wipatanil anintegrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT mullenjoseph integrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT cockellsimonj integrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT woollardpeter integrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations AT wipatanil integrateddatadrivenapproachtodrugrepositioningusinggenediseaseassociations

An Integrated Data Driven Approach to Drug Repositioning Using Gene-Disease Associations

Ejemplares similares