Cargando…

On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme

Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure–activity relationship have been proposed to screen out cardiotoxic compounds and have sho...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Huijia, Zhu, Guangxian, Izu, Leighton T., Chen-Izu, Ye, Ono, Naoaki, Altaf-Ul-Amin, MD, Kanaya, Shigehiko, Huang, Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10203956/
https://www.ncbi.nlm.nih.gov/pubmed/37228825
http://dx.doi.org/10.3389/fphys.2023.1156286
_version_ 1785045742482620416
author Wang, Huijia
Zhu, Guangxian
Izu, Leighton T.
Chen-Izu, Ye
Ono, Naoaki
Altaf-Ul-Amin, MD
Kanaya, Shigehiko
Huang, Ming
author_facet Wang, Huijia
Zhu, Guangxian
Izu, Leighton T.
Chen-Izu, Ye
Ono, Naoaki
Altaf-Ul-Amin, MD
Kanaya, Shigehiko
Huang, Ming
author_sort Wang, Huijia
collection PubMed
description Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure–activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure–activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: [Formula: see text] ; non-active: [Formula: see text] ). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability.
format Online
Article
Text
id pubmed-10203956
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102039562023-05-24 On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme Wang, Huijia Zhu, Guangxian Izu, Leighton T. Chen-Izu, Ye Ono, Naoaki Altaf-Ul-Amin, MD Kanaya, Shigehiko Huang, Ming Front Physiol Physiology Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure–activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure–activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: [Formula: see text] ; non-active: [Formula: see text] ). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability. Frontiers Media S.A. 2023-05-09 /pmc/articles/PMC10203956/ /pubmed/37228825 http://dx.doi.org/10.3389/fphys.2023.1156286 Text en Copyright © 2023 Wang, Zhu, Izu, Chen-Izu, Ono, Altaf-Ul-Amin, Kanaya and Huang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Wang, Huijia
Zhu, Guangxian
Izu, Leighton T.
Chen-Izu, Ye
Ono, Naoaki
Altaf-Ul-Amin, MD
Kanaya, Shigehiko
Huang, Ming
On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title_full On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title_fullStr On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title_full_unstemmed On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title_short On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
title_sort on qsar-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10203956/
https://www.ncbi.nlm.nih.gov/pubmed/37228825
http://dx.doi.org/10.3389/fphys.2023.1156286
work_keys_str_mv AT wanghuijia onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT zhuguangxian onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT izuleightont onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT chenizuye onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT ononaoaki onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT altafulaminmd onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT kanayashigehiko onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme
AT huangming onqsarbasedcardiotoxicitymodelingwiththeexpressivenessenhancedgraphlearningmodelanddualthresholdscheme