Cargando…

Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction

BACKGROUND: Previous studies have noted that drug targets appear to be associated with higher-degree or higher-centrality proteins in interaction networks. These studies explicitly or tacitly make choices of different source databases, data integration strategies, representation of proteins and comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Mora, Antonio, Donaldson, Ian M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534413/
https://www.ncbi.nlm.nih.gov/pubmed/23146171
http://dx.doi.org/10.1186/1471-2105-13-294
_version_ 1782475334362333184
author Mora, Antonio
Donaldson, Ian M
author_facet Mora, Antonio
Donaldson, Ian M
author_sort Mora, Antonio
collection PubMed
description BACKGROUND: Previous studies have noted that drug targets appear to be associated with higher-degree or higher-centrality proteins in interaction networks. These studies explicitly or tacitly make choices of different source databases, data integration strategies, representation of proteins and complexes, and data reliability assumptions. Here we examined how the use of different data integration and representation techniques, or different notions of reliability, may affect the efficacy of degree and centrality as features in drug target prediction. RESULTS: Fifty percent of drug targets have a degree of less than nine, and ninety-five percent have a degree of less than ninety. We found that drug targets are over-represented in higher degree bins – this relationship is only seen for the consolidated interactome and it is not dependent on n-ary interaction data or its representation. Degree acts as a weak predictive feature for drug-target status and using more reliable subsets of the data does not increase this performance. However, performance does increase if only cancer-related drug targets are considered. We also note that a protein’s membership in pathway records can act as a predictive feature that is better than degree and that high-centrality may be an indicator of a drug that is more likely to be withdrawn. CONCLUSIONS: These results show that protein interaction data integration and cleaning is an important consideration when incorporating network properties as predictive features for drug-target status. The provided scripts and data sets offer a starting point for further studies and cross-comparison of methods.
format Online
Article
Text
id pubmed-3534413
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35344132013-01-03 Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction Mora, Antonio Donaldson, Ian M BMC Bioinformatics Research Article BACKGROUND: Previous studies have noted that drug targets appear to be associated with higher-degree or higher-centrality proteins in interaction networks. These studies explicitly or tacitly make choices of different source databases, data integration strategies, representation of proteins and complexes, and data reliability assumptions. Here we examined how the use of different data integration and representation techniques, or different notions of reliability, may affect the efficacy of degree and centrality as features in drug target prediction. RESULTS: Fifty percent of drug targets have a degree of less than nine, and ninety-five percent have a degree of less than ninety. We found that drug targets are over-represented in higher degree bins – this relationship is only seen for the consolidated interactome and it is not dependent on n-ary interaction data or its representation. Degree acts as a weak predictive feature for drug-target status and using more reliable subsets of the data does not increase this performance. However, performance does increase if only cancer-related drug targets are considered. We also note that a protein’s membership in pathway records can act as a predictive feature that is better than degree and that high-centrality may be an indicator of a drug that is more likely to be withdrawn. CONCLUSIONS: These results show that protein interaction data integration and cleaning is an important consideration when incorporating network properties as predictive features for drug-target status. The provided scripts and data sets offer a starting point for further studies and cross-comparison of methods. BioMed Central 2012-11-12 /pmc/articles/PMC3534413/ /pubmed/23146171 http://dx.doi.org/10.1186/1471-2105-13-294 Text en Copyright ©2012 Mora and Donaldson; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mora, Antonio
Donaldson, Ian M
Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title_full Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title_fullStr Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title_full_unstemmed Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title_short Effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
title_sort effects of protein interaction data integration, representation and reliability on the use of network properties for drug target prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534413/
https://www.ncbi.nlm.nih.gov/pubmed/23146171
http://dx.doi.org/10.1186/1471-2105-13-294
work_keys_str_mv AT moraantonio effectsofproteininteractiondataintegrationrepresentationandreliabilityontheuseofnetworkpropertiesfordrugtargetprediction
AT donaldsonianm effectsofproteininteractiondataintegrationrepresentationandreliabilityontheuseofnetworkpropertiesfordrugtargetprediction