Cargando…

STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity

Membrane transport proteins play crucial roles in the pharmacokinetics of substrate drugs, the drug resistance in cancer and are vital to the process of drug discovery, development and anti-cancer therapeutics. However, experimental methods to profile a substrate drug against a panel of transporters...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiangeng, Zhu, Xiaolei, Ye, Mingzhi, Wang, Yanjing, Li, Cheng-Dong, Xiong, Yi, Wei, Dong-Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6851049/
https://www.ncbi.nlm.nih.gov/pubmed/31781551
http://dx.doi.org/10.3389/fbioe.2019.00306
_version_ 1783469556897939456
author Wang, Xiangeng
Zhu, Xiaolei
Ye, Mingzhi
Wang, Yanjing
Li, Cheng-Dong
Xiong, Yi
Wei, Dong-Qing
author_facet Wang, Xiangeng
Zhu, Xiaolei
Ye, Mingzhi
Wang, Yanjing
Li, Cheng-Dong
Xiong, Yi
Wei, Dong-Qing
author_sort Wang, Xiangeng
collection PubMed
description Membrane transport proteins play crucial roles in the pharmacokinetics of substrate drugs, the drug resistance in cancer and are vital to the process of drug discovery, development and anti-cancer therapeutics. However, experimental methods to profile a substrate drug against a panel of transporters to determine its specificity are labor intensive and time consuming. In this article, we aim to develop an in silico multi-label classification approach to predict whether a substrate can specifically recognize one of the 13 categories of drug transporters ranging from ATP-binding cassette to solute carrier families using both structural fingerprints and chemical ontologies information of substrates. The data-driven network-based label space partition (NLSP) method was utilized to construct the model based on a hybrid of similarity-based feature by the integration of 2D fingerprint and semantic similarity. This method builds predictors for each label cluster (possibly intersecting) detected by community detection algorithms and takes union of label sets for a compound as final prediction. NLSP lies into the ensembles of multi-label classifier category in multi-label learning field. We utilized Cramér's V statistics to quantify the label correlations and depicted them via a heatmap. The jackknife tests and iterative stratification based cross-validation method were adopted on a benchmark dataset to evaluate the prediction performance of the proposed models both in multi-label and label-wise manner. Compared with other powerful multi-label methods, ML-kNN, MTSVM, and RAkELd, our multi-label classification model of NLPS-RF (random forest-based NLSP) has proven to be a feasible and effective model, and performed satisfactorily in the predictive task of transporter-substrate specificity. The idea behind NLSP method is intriguing and the power of NLSP remains to be explored for the multi-label learning problems in bioinformatics. The benchmark dataset, intermediate results and python code which can fully reproduce our experiments and results are available at https://github.com/dqwei-lab/STS.
format Online
Article
Text
id pubmed-6851049
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68510492019-11-28 STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity Wang, Xiangeng Zhu, Xiaolei Ye, Mingzhi Wang, Yanjing Li, Cheng-Dong Xiong, Yi Wei, Dong-Qing Front Bioeng Biotechnol Bioengineering and Biotechnology Membrane transport proteins play crucial roles in the pharmacokinetics of substrate drugs, the drug resistance in cancer and are vital to the process of drug discovery, development and anti-cancer therapeutics. However, experimental methods to profile a substrate drug against a panel of transporters to determine its specificity are labor intensive and time consuming. In this article, we aim to develop an in silico multi-label classification approach to predict whether a substrate can specifically recognize one of the 13 categories of drug transporters ranging from ATP-binding cassette to solute carrier families using both structural fingerprints and chemical ontologies information of substrates. The data-driven network-based label space partition (NLSP) method was utilized to construct the model based on a hybrid of similarity-based feature by the integration of 2D fingerprint and semantic similarity. This method builds predictors for each label cluster (possibly intersecting) detected by community detection algorithms and takes union of label sets for a compound as final prediction. NLSP lies into the ensembles of multi-label classifier category in multi-label learning field. We utilized Cramér's V statistics to quantify the label correlations and depicted them via a heatmap. The jackknife tests and iterative stratification based cross-validation method were adopted on a benchmark dataset to evaluate the prediction performance of the proposed models both in multi-label and label-wise manner. Compared with other powerful multi-label methods, ML-kNN, MTSVM, and RAkELd, our multi-label classification model of NLPS-RF (random forest-based NLSP) has proven to be a feasible and effective model, and performed satisfactorily in the predictive task of transporter-substrate specificity. The idea behind NLSP method is intriguing and the power of NLSP remains to be explored for the multi-label learning problems in bioinformatics. The benchmark dataset, intermediate results and python code which can fully reproduce our experiments and results are available at https://github.com/dqwei-lab/STS. Frontiers Media S.A. 2019-11-06 /pmc/articles/PMC6851049/ /pubmed/31781551 http://dx.doi.org/10.3389/fbioe.2019.00306 Text en Copyright © 2019 Wang, Zhu, Ye, Wang, Li, Xiong and Wei. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Wang, Xiangeng
Zhu, Xiaolei
Ye, Mingzhi
Wang, Yanjing
Li, Cheng-Dong
Xiong, Yi
Wei, Dong-Qing
STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title_full STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title_fullStr STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title_full_unstemmed STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title_short STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
title_sort sts-nlsp: a network-based label space partition method for predicting the specificity of membrane transporter substrates using a hybrid feature of structural and semantic similarity
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6851049/
https://www.ncbi.nlm.nih.gov/pubmed/31781551
http://dx.doi.org/10.3389/fbioe.2019.00306
work_keys_str_mv AT wangxiangeng stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT zhuxiaolei stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT yemingzhi stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT wangyanjing stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT lichengdong stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT xiongyi stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity
AT weidongqing stsnlspanetworkbasedlabelspacepartitionmethodforpredictingthespecificityofmembranetransportersubstratesusingahybridfeatureofstructuralandsemanticsimilarity