Cargando…

Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network

BACKGROUND: Information about drug-target relations is at the heart of drug discovery. There are now dozens of databases providing drug-target interaction data with varying scope, and focus. Therefore, and due to the large chemical space, the overlap of the different data sets is surprisingly small....

Descripción completa

Detalles Bibliográficos
Autores principales: Roider, Helge G, Pavlova, Nadia, Kirov, Ivaylo, Slavov, Stoyan, Slavov, Todor, Uzunov, Zlatyo, Weiss, Bertram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234465/
https://www.ncbi.nlm.nih.gov/pubmed/24618344
http://dx.doi.org/10.1186/1471-2105-15-68
_version_ 1782344865586085888
author Roider, Helge G
Pavlova, Nadia
Kirov, Ivaylo
Slavov, Stoyan
Slavov, Todor
Uzunov, Zlatyo
Weiss, Bertram
author_facet Roider, Helge G
Pavlova, Nadia
Kirov, Ivaylo
Slavov, Stoyan
Slavov, Todor
Uzunov, Zlatyo
Weiss, Bertram
author_sort Roider, Helge G
collection PubMed
description BACKGROUND: Information about drug-target relations is at the heart of drug discovery. There are now dozens of databases providing drug-target interaction data with varying scope, and focus. Therefore, and due to the large chemical space, the overlap of the different data sets is surprisingly small. As searching through these sources manually is cumbersome, time-consuming and error-prone, integrating all the data is highly desirable. Despite a few attempts, integration has been hampered by the diversity of descriptions of compounds, and by the fact that the reported activity values, coming from different data sets, are not always directly comparable due to usage of different metrics or data formats. DESCRIPTION: We have built Drug2Gene, a knowledge base, which combines the compound/drug-gene/protein information from 19 publicly available databases. A key feature is our rigorous unification and standardization process which makes the data truly comparable on a large scale, allowing for the first time effective data mining in such a large knowledge corpus. As of version 3.2, Drug2Gene contains 4,372,290 unified relations between compounds and their targets most of which include reported bioactivity data. We extend this set with putative (i.e. homology-inferred) relations where sufficient sequence homology between proteins suggests they may bind to similar compounds. Drug2Gene provides powerful search functionalities, very flexible export procedures, and a user-friendly web interface. CONCLUSIONS: Drug2Gene v3.2 has become a mature and comprehensive knowledge base providing unified, standardized drug-target related information gathered from publicly available data sources. It can be used to integrate proprietary data sets with publicly available data sets. Its main goal is to be a ‘one-stop shop’ to identify tool compounds targeting a given gene product or for finding all known targets of a drug. Drug2Gene with its integrated data set of public compound-target relations is freely accessible without restrictions at http://www.drug2gene.com.
format Online
Article
Text
id pubmed-4234465
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42344652014-11-18 Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network Roider, Helge G Pavlova, Nadia Kirov, Ivaylo Slavov, Stoyan Slavov, Todor Uzunov, Zlatyo Weiss, Bertram BMC Bioinformatics Database BACKGROUND: Information about drug-target relations is at the heart of drug discovery. There are now dozens of databases providing drug-target interaction data with varying scope, and focus. Therefore, and due to the large chemical space, the overlap of the different data sets is surprisingly small. As searching through these sources manually is cumbersome, time-consuming and error-prone, integrating all the data is highly desirable. Despite a few attempts, integration has been hampered by the diversity of descriptions of compounds, and by the fact that the reported activity values, coming from different data sets, are not always directly comparable due to usage of different metrics or data formats. DESCRIPTION: We have built Drug2Gene, a knowledge base, which combines the compound/drug-gene/protein information from 19 publicly available databases. A key feature is our rigorous unification and standardization process which makes the data truly comparable on a large scale, allowing for the first time effective data mining in such a large knowledge corpus. As of version 3.2, Drug2Gene contains 4,372,290 unified relations between compounds and their targets most of which include reported bioactivity data. We extend this set with putative (i.e. homology-inferred) relations where sufficient sequence homology between proteins suggests they may bind to similar compounds. Drug2Gene provides powerful search functionalities, very flexible export procedures, and a user-friendly web interface. CONCLUSIONS: Drug2Gene v3.2 has become a mature and comprehensive knowledge base providing unified, standardized drug-target related information gathered from publicly available data sources. It can be used to integrate proprietary data sets with publicly available data sets. Its main goal is to be a ‘one-stop shop’ to identify tool compounds targeting a given gene product or for finding all known targets of a drug. Drug2Gene with its integrated data set of public compound-target relations is freely accessible without restrictions at http://www.drug2gene.com. BioMed Central 2014-03-11 /pmc/articles/PMC4234465/ /pubmed/24618344 http://dx.doi.org/10.1186/1471-2105-15-68 Text en Copyright © 2014 Roider et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Roider, Helge G
Pavlova, Nadia
Kirov, Ivaylo
Slavov, Stoyan
Slavov, Todor
Uzunov, Zlatyo
Weiss, Bertram
Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title_full Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title_fullStr Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title_full_unstemmed Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title_short Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network
title_sort drug2gene: an exhaustive resource to explore effectively the drug-target relation network
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234465/
https://www.ncbi.nlm.nih.gov/pubmed/24618344
http://dx.doi.org/10.1186/1471-2105-15-68
work_keys_str_mv AT roiderhelgeg drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT pavlovanadia drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT kirovivaylo drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT slavovstoyan drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT slavovtodor drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT uzunovzlatyo drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork
AT weissbertram drug2geneanexhaustiveresourcetoexploreeffectivelythedrugtargetrelationnetwork