Cargando…

Machine learning prediction of oncology drug targets based on protein and network properties

BACKGROUND: The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. RESULTS: We...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dezső, Zoltán, Ceccarelli, Michele
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2020
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071582/ https://www.ncbi.nlm.nih.gov/pubmed/32171238 http://dx.doi.org/10.1186/s12859-020-3442-9

_version_	1783506234547109888
author	Dezső, Zoltán Ceccarelli, Michele
author_facet	Dezső, Zoltán Ceccarelli, Michele
author_sort	Dezső, Zoltán
collection	PubMed
description	BACKGROUND: The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. RESULTS: We developed a machine learning approach to score proteins to generate a druggability score of novel targets. In our model we incorporated 70 protein features which included properties derived from the sequence, features characterizing protein functions as well as network properties derived from the protein-protein interaction network. The advantage of this approach is that it is unbiased and even less studied proteins with limited information about their function can score well as most of the features are independent of the accumulated literature. We build models on a training set which consist of targets with approved drugs and a negative set of non-drug targets. The machine learning techniques help to identify the most important combination of features differentiating validated targets from non-targets. We validated our predictions on an independent set of clinical trial drug targets, achieving a high accuracy characterized by an Area Under the Curve (AUC) of 0.89. Our most predictive features included biological function of proteins, network centrality measures, protein essentiality, tissue specificity, localization and solvent accessibility. Our predictions, based on a small set of 102 validated oncology targets, recovered the majority of known drug targets and identifies a novel set of proteins as drug target candidates. CONCLUSIONS: We developed a machine learning approach to prioritize proteins according to their similarity to approved drug targets. We have shown that the method proposed is highly predictive on a validation dataset consisting of 277 targets of clinical trial drug confirming that our computational approach is an efficient and cost-effective tool for drug target discovery and prioritization. Our predictions were based on oncology targets and cancer relevant biological functions, resulting in significantly higher scores for targets of oncology clinical trial drugs compared to the scores of targets of trial drugs for other indications. Our approach can be used to make indication specific drug-target prediction by combining generic druggability features with indication specific biological functions.
format	Online Article Text
id	pubmed-7071582
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-70715822020-03-18 Machine learning prediction of oncology drug targets based on protein and network properties Dezső, Zoltán Ceccarelli, Michele BMC Bioinformatics Methodology Article BACKGROUND: The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. RESULTS: We developed a machine learning approach to score proteins to generate a druggability score of novel targets. In our model we incorporated 70 protein features which included properties derived from the sequence, features characterizing protein functions as well as network properties derived from the protein-protein interaction network. The advantage of this approach is that it is unbiased and even less studied proteins with limited information about their function can score well as most of the features are independent of the accumulated literature. We build models on a training set which consist of targets with approved drugs and a negative set of non-drug targets. The machine learning techniques help to identify the most important combination of features differentiating validated targets from non-targets. We validated our predictions on an independent set of clinical trial drug targets, achieving a high accuracy characterized by an Area Under the Curve (AUC) of 0.89. Our most predictive features included biological function of proteins, network centrality measures, protein essentiality, tissue specificity, localization and solvent accessibility. Our predictions, based on a small set of 102 validated oncology targets, recovered the majority of known drug targets and identifies a novel set of proteins as drug target candidates. CONCLUSIONS: We developed a machine learning approach to prioritize proteins according to their similarity to approved drug targets. We have shown that the method proposed is highly predictive on a validation dataset consisting of 277 targets of clinical trial drug confirming that our computational approach is an efficient and cost-effective tool for drug target discovery and prioritization. Our predictions were based on oncology targets and cancer relevant biological functions, resulting in significantly higher scores for targets of oncology clinical trial drugs compared to the scores of targets of trial drugs for other indications. Our approach can be used to make indication specific drug-target prediction by combining generic druggability features with indication specific biological functions. BioMed Central 2020-03-14 /pmc/articles/PMC7071582/ /pubmed/32171238 http://dx.doi.org/10.1186/s12859-020-3442-9 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Methodology Article Dezső, Zoltán Ceccarelli, Michele Machine learning prediction of oncology drug targets based on protein and network properties
title	Machine learning prediction of oncology drug targets based on protein and network properties
title_full	Machine learning prediction of oncology drug targets based on protein and network properties
title_fullStr	Machine learning prediction of oncology drug targets based on protein and network properties
title_full_unstemmed	Machine learning prediction of oncology drug targets based on protein and network properties
title_short	Machine learning prediction of oncology drug targets based on protein and network properties
title_sort	machine learning prediction of oncology drug targets based on protein and network properties
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071582/ https://www.ncbi.nlm.nih.gov/pubmed/32171238 http://dx.doi.org/10.1186/s12859-020-3442-9
work_keys_str_mv	AT dezsozoltan machinelearningpredictionofoncologydrugtargetsbasedonproteinandnetworkproperties AT ceccarellimichele machinelearningpredictionofoncologydrugtargetsbasedonproteinandnetworkproperties

Machine learning prediction of oncology drug targets based on protein and network properties

Ejemplares similares