Cargando…
Interpreting linear support vector machine models with heat map molecule coloring
BACKGROUND: Model-based virtual screening plays an important role in the early drug discovery stage. The outcomes of high-throughput screenings are a valuable source for machine learning algorithms to infer such models. Besides a strong performance, the interpretability of a machine learning model i...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3076244/ https://www.ncbi.nlm.nih.gov/pubmed/21439031 http://dx.doi.org/10.1186/1758-2946-3-11 |
_version_ | 1782201817296273408 |
---|---|
author | Rosenbaum, Lars Hinselmann, Georg Jahn, Andreas Zell, Andreas |
author_facet | Rosenbaum, Lars Hinselmann, Georg Jahn, Andreas Zell, Andreas |
author_sort | Rosenbaum, Lars |
collection | PubMed |
description | BACKGROUND: Model-based virtual screening plays an important role in the early drug discovery stage. The outcomes of high-throughput screenings are a valuable source for machine learning algorithms to infer such models. Besides a strong performance, the interpretability of a machine learning model is a desired property to guide the optimization of a compound in later drug discovery stages. Linear support vector machines showed to have a convincing performance on large-scale data sets. The goal of this study is to present a heat map molecule coloring technique to interpret linear support vector machine models. Based on the weights of a linear model, the visualization approach colors each atom and bond of a compound according to its importance for activity. RESULTS: We evaluated our approach on a toxicity data set, a chromosome aberration data set, and the maximum unbiased validation data sets. The experiments show that our method sensibly visualizes structure-property and structure-activity relationships of a linear support vector machine model. The coloring of ligands in the binding pocket of several crystal structures of a maximum unbiased validation data set target indicates that our approach assists to determine the correct ligand orientation in the binding pocket. Additionally, the heat map coloring enables the identification of substructures important for the binding of an inhibitor. CONCLUSIONS: In combination with heat map coloring, linear support vector machine models can help to guide the modification of a compound in later stages of drug discovery. Particularly substructures identified as important by our method might be a starting point for optimization of a lead compound. The heat map coloring should be considered as complementary to structure based modeling approaches. As such, it helps to get a better understanding of the binding mode of an inhibitor. |
format | Text |
id | pubmed-3076244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30762442011-04-14 Interpreting linear support vector machine models with heat map molecule coloring Rosenbaum, Lars Hinselmann, Georg Jahn, Andreas Zell, Andreas J Cheminform Research Article BACKGROUND: Model-based virtual screening plays an important role in the early drug discovery stage. The outcomes of high-throughput screenings are a valuable source for machine learning algorithms to infer such models. Besides a strong performance, the interpretability of a machine learning model is a desired property to guide the optimization of a compound in later drug discovery stages. Linear support vector machines showed to have a convincing performance on large-scale data sets. The goal of this study is to present a heat map molecule coloring technique to interpret linear support vector machine models. Based on the weights of a linear model, the visualization approach colors each atom and bond of a compound according to its importance for activity. RESULTS: We evaluated our approach on a toxicity data set, a chromosome aberration data set, and the maximum unbiased validation data sets. The experiments show that our method sensibly visualizes structure-property and structure-activity relationships of a linear support vector machine model. The coloring of ligands in the binding pocket of several crystal structures of a maximum unbiased validation data set target indicates that our approach assists to determine the correct ligand orientation in the binding pocket. Additionally, the heat map coloring enables the identification of substructures important for the binding of an inhibitor. CONCLUSIONS: In combination with heat map coloring, linear support vector machine models can help to guide the modification of a compound in later stages of drug discovery. Particularly substructures identified as important by our method might be a starting point for optimization of a lead compound. The heat map coloring should be considered as complementary to structure based modeling approaches. As such, it helps to get a better understanding of the binding mode of an inhibitor. BioMed Central 2011-03-25 /pmc/articles/PMC3076244/ /pubmed/21439031 http://dx.doi.org/10.1186/1758-2946-3-11 Text en Copyright ©2011 Rosenbaum et al; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Rosenbaum, Lars Hinselmann, Georg Jahn, Andreas Zell, Andreas Interpreting linear support vector machine models with heat map molecule coloring |
title | Interpreting linear support vector machine models with heat map molecule coloring |
title_full | Interpreting linear support vector machine models with heat map molecule coloring |
title_fullStr | Interpreting linear support vector machine models with heat map molecule coloring |
title_full_unstemmed | Interpreting linear support vector machine models with heat map molecule coloring |
title_short | Interpreting linear support vector machine models with heat map molecule coloring |
title_sort | interpreting linear support vector machine models with heat map molecule coloring |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3076244/ https://www.ncbi.nlm.nih.gov/pubmed/21439031 http://dx.doi.org/10.1186/1758-2946-3-11 |
work_keys_str_mv | AT rosenbaumlars interpretinglinearsupportvectormachinemodelswithheatmapmoleculecoloring AT hinselmanngeorg interpretinglinearsupportvectormachinemodelswithheatmapmoleculecoloring AT jahnandreas interpretinglinearsupportvectormachinemodelswithheatmapmoleculecoloring AT zellandreas interpretinglinearsupportvectormachinemodelswithheatmapmoleculecoloring |