Cargando…

Drug Target Identification with Machine Learning: How to Choose Negative Examples

Identification of the protein targets of hit molecules is essential in the drug discovery process. Target prediction with machine learning algorithms can help accelerate this search, limiting the number of required experiments. However, Drug-Target Interactions databases used for training present hi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Najm, Matthieu, Azencott, Chloé-Agathe, Playe, Benoit, Stoven, Véronique
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8151112/ https://www.ncbi.nlm.nih.gov/pubmed/34066072 http://dx.doi.org/10.3390/ijms22105118

_version_	1783698306876047360
author	Najm, Matthieu Azencott, Chloé-Agathe Playe, Benoit Stoven, Véronique
author_facet	Najm, Matthieu Azencott, Chloé-Agathe Playe, Benoit Stoven, Véronique
author_sort	Najm, Matthieu
collection	PubMed
description	Identification of the protein targets of hit molecules is essential in the drug discovery process. Target prediction with machine learning algorithms can help accelerate this search, limiting the number of required experiments. However, Drug-Target Interactions databases used for training present high statistical bias, leading to a high number of false positives, thus increasing time and cost of experimental validation campaigns. To minimize the number of false positives among predicted targets, we propose a new scheme for choosing negative examples, so that each protein and each drug appears an equal number of times in positive and negative examples. We artificially reproduce the process of target identification for three specific drugs, and more globally for 200 approved drugs. For the detailed three drug examples, and for the larger set of 200 drugs, training with the proposed scheme for the choice of negative examples improved target prediction results: the average number of false positives among the top ranked predicted targets decreased, and overall, the rank of the true targets was improved.Our method corrects databases’ statistical bias and reduces the number of false positive predictions, and therefore the number of useless experiments potentially undertaken.
format	Online Article Text
id	pubmed-8151112
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81511122021-05-27 Drug Target Identification with Machine Learning: How to Choose Negative Examples Najm, Matthieu Azencott, Chloé-Agathe Playe, Benoit Stoven, Véronique Int J Mol Sci Article Identification of the protein targets of hit molecules is essential in the drug discovery process. Target prediction with machine learning algorithms can help accelerate this search, limiting the number of required experiments. However, Drug-Target Interactions databases used for training present high statistical bias, leading to a high number of false positives, thus increasing time and cost of experimental validation campaigns. To minimize the number of false positives among predicted targets, we propose a new scheme for choosing negative examples, so that each protein and each drug appears an equal number of times in positive and negative examples. We artificially reproduce the process of target identification for three specific drugs, and more globally for 200 approved drugs. For the detailed three drug examples, and for the larger set of 200 drugs, training with the proposed scheme for the choice of negative examples improved target prediction results: the average number of false positives among the top ranked predicted targets decreased, and overall, the rank of the true targets was improved.Our method corrects databases’ statistical bias and reduces the number of false positive predictions, and therefore the number of useless experiments potentially undertaken. MDPI 2021-05-12 /pmc/articles/PMC8151112/ /pubmed/34066072 http://dx.doi.org/10.3390/ijms22105118 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Najm, Matthieu Azencott, Chloé-Agathe Playe, Benoit Stoven, Véronique Drug Target Identification with Machine Learning: How to Choose Negative Examples
title	Drug Target Identification with Machine Learning: How to Choose Negative Examples
title_full	Drug Target Identification with Machine Learning: How to Choose Negative Examples
title_fullStr	Drug Target Identification with Machine Learning: How to Choose Negative Examples
title_full_unstemmed	Drug Target Identification with Machine Learning: How to Choose Negative Examples
title_short	Drug Target Identification with Machine Learning: How to Choose Negative Examples
title_sort	drug target identification with machine learning: how to choose negative examples
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8151112/ https://www.ncbi.nlm.nih.gov/pubmed/34066072 http://dx.doi.org/10.3390/ijms22105118
work_keys_str_mv	AT najmmatthieu drugtargetidentificationwithmachinelearninghowtochoosenegativeexamples AT azencottchloeagathe drugtargetidentificationwithmachinelearninghowtochoosenegativeexamples AT playebenoit drugtargetidentificationwithmachinelearninghowtochoosenegativeexamples AT stovenveronique drugtargetidentificationwithmachinelearninghowtochoosenegativeexamples

Drug Target Identification with Machine Learning: How to Choose Negative Examples

Ejemplares similares