Cargando…

Integrative relational machine-learning for understanding drug side-effect profiles

BACKGROUND: Drug side effects represent a common reason for stopping drug development during clinical trials. Improving our ability to understand drug side effects is necessary to reduce attrition rates during drug development as well as the risk of discovering novel side effects in available drugs....

Descripción completa

Detalles Bibliográficos
Autores principales:	Bresso, Emmanuel, Grisoni, Renaud, Marchetti, Gino, Karaboga, Arnaud Sinan, Souchet, Devignes, Marie-Dominique, Smaïl-Tabbone, Malika
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3710241/ https://www.ncbi.nlm.nih.gov/pubmed/23802887 http://dx.doi.org/10.1186/1471-2105-14-207

_version_	1782276851538853888
author	Bresso, Emmanuel Grisoni, Renaud Marchetti, Gino Karaboga, Arnaud Sinan Souchet Devignes, Marie-Dominique Smaïl-Tabbone, Malika
author_facet	Bresso, Emmanuel Grisoni, Renaud Marchetti, Gino Karaboga, Arnaud Sinan Souchet Devignes, Marie-Dominique Smaïl-Tabbone, Malika
author_sort	Bresso, Emmanuel
collection	PubMed
description	BACKGROUND: Drug side effects represent a common reason for stopping drug development during clinical trials. Improving our ability to understand drug side effects is necessary to reduce attrition rates during drug development as well as the risk of discovering novel side effects in available drugs. Today, most investigations deal with isolated side effects and overlook possible redundancy and their frequent co-occurrence. RESULTS: In this work, drug annotations are collected from SIDER and DrugBank databases. Terms describing individual side effects reported in SIDER are clustered with a semantic similarity measure into term clusters (TCs). Maximal frequent itemsets are extracted from the resulting drug x TC binary table, leading to the identification of what we call side-effect profiles (SEPs). A SEP is defined as the longest combination of TCs which are shared by a significant number of drugs. Frequent SEPs are explored on the basis of integrated drug and target descriptors using two machine learning methods: decision-trees and inductive-logic programming. Although both methods yield explicit models, inductive-logic programming method performs relational learning and is able to exploit not only drug properties but also background knowledge. Learning efficiency is evaluated by cross-validation and direct testing with new molecules. Comparison of the two machine-learning methods shows that the inductive-logic-programming method displays a greater sensitivity than decision trees and successfully exploit background knowledge such as functional annotations and pathways of drug targets, thereby producing rich and expressive rules. All models and theories are available on a dedicated web site. CONCLUSIONS: Side effect profiles covering significant number of drugs have been extracted from a drug ×side-effect association table. Integration of background knowledge concerning both chemical and biological spaces has been combined with a relational learning method for discovering rules which explicitly characterize drug-SEP associations. These rules are successfully used for predicting SEPs associated with new drugs.
format	Online Article Text
id	pubmed-3710241
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-37102412013-07-13 Integrative relational machine-learning for understanding drug side-effect profiles Bresso, Emmanuel Grisoni, Renaud Marchetti, Gino Karaboga, Arnaud Sinan Souchet Devignes, Marie-Dominique Smaïl-Tabbone, Malika BMC Bioinformatics Research Article BACKGROUND: Drug side effects represent a common reason for stopping drug development during clinical trials. Improving our ability to understand drug side effects is necessary to reduce attrition rates during drug development as well as the risk of discovering novel side effects in available drugs. Today, most investigations deal with isolated side effects and overlook possible redundancy and their frequent co-occurrence. RESULTS: In this work, drug annotations are collected from SIDER and DrugBank databases. Terms describing individual side effects reported in SIDER are clustered with a semantic similarity measure into term clusters (TCs). Maximal frequent itemsets are extracted from the resulting drug x TC binary table, leading to the identification of what we call side-effect profiles (SEPs). A SEP is defined as the longest combination of TCs which are shared by a significant number of drugs. Frequent SEPs are explored on the basis of integrated drug and target descriptors using two machine learning methods: decision-trees and inductive-logic programming. Although both methods yield explicit models, inductive-logic programming method performs relational learning and is able to exploit not only drug properties but also background knowledge. Learning efficiency is evaluated by cross-validation and direct testing with new molecules. Comparison of the two machine-learning methods shows that the inductive-logic-programming method displays a greater sensitivity than decision trees and successfully exploit background knowledge such as functional annotations and pathways of drug targets, thereby producing rich and expressive rules. All models and theories are available on a dedicated web site. CONCLUSIONS: Side effect profiles covering significant number of drugs have been extracted from a drug ×side-effect association table. Integration of background knowledge concerning both chemical and biological spaces has been combined with a relational learning method for discovering rules which explicitly characterize drug-SEP associations. These rules are successfully used for predicting SEPs associated with new drugs. BioMed Central 2013-06-26 /pmc/articles/PMC3710241/ /pubmed/23802887 http://dx.doi.org/10.1186/1471-2105-14-207 Text en Copyright © 2013 Bresso et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Bresso, Emmanuel Grisoni, Renaud Marchetti, Gino Karaboga, Arnaud Sinan Souchet Devignes, Marie-Dominique Smaïl-Tabbone, Malika Integrative relational machine-learning for understanding drug side-effect profiles
title	Integrative relational machine-learning for understanding drug side-effect profiles
title_full	Integrative relational machine-learning for understanding drug side-effect profiles
title_fullStr	Integrative relational machine-learning for understanding drug side-effect profiles
title_full_unstemmed	Integrative relational machine-learning for understanding drug side-effect profiles
title_short	Integrative relational machine-learning for understanding drug side-effect profiles
title_sort	integrative relational machine-learning for understanding drug side-effect profiles
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3710241/ https://www.ncbi.nlm.nih.gov/pubmed/23802887 http://dx.doi.org/10.1186/1471-2105-14-207
work_keys_str_mv	AT bressoemmanuel integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT grisonirenaud integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT marchettigino integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT karabogaarnaudsinan integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT souchet integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT devignesmariedominique integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles AT smailtabbonemalika integrativerelationalmachinelearningforunderstandingdrugsideeffectprofiles

Integrative relational machine-learning for understanding drug side-effect profiles

Ejemplares similares