Cargando…

Prediction of Compound Bioactivities Using Heat-Diffusion Equation

Machine learning is expected to improve low throughput and high assay cost in cell-based phenotypic screening. However, it is still a challenge to apply machine learning to achieving sufficiently complex phenotypic screening due to imbalanced datasets, non-linear prediction, and unpredictability of...

Descripción completa

Detalles Bibliográficos
Autores principales: Hidaka, Tadashi, Imamura, Keiko, Hioki, Takeshi, Takagi, Terufumi, Giga, Yoshikazu, Giga, Mi-Ho, Nishimura, Yoshiteru, Kawahara, Yoshinobu, Hayashi, Satoru, Niki, Takeshi, Fushimi, Makoto, Inoue, Haruhisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733880/
https://www.ncbi.nlm.nih.gov/pubmed/33336198
http://dx.doi.org/10.1016/j.patter.2020.100140
Descripción
Sumario:Machine learning is expected to improve low throughput and high assay cost in cell-based phenotypic screening. However, it is still a challenge to apply machine learning to achieving sufficiently complex phenotypic screening due to imbalanced datasets, non-linear prediction, and unpredictability of new chemotypes. Here, we developed a prediction model based on the heat-diffusion equation (PM-HDE) to address this issue. The algorithm was verified as feasible for virtual compound screening using biotest data of 946 assay systems registered with PubChem. PM-HDE was then applied to actual screening. Based on supervised learning of the data of about 50,000 compounds from biological phenotypic screening with motor neurons derived from ALS-patient-induced pluripotent stem cells, virtual screening of >1.6 million compounds was implemented. We confirmed that PM-HDE enriched the hit compounds and identified new chemotypes. This prediction model could overcome the inflexibility in machine learning, and our approach could provide a novel platform for drug discovery.