Cargando…

Data Efficiency Semi-Supervised Meta-Learning Elucidates Understudied Interspecies Molecular Interactions

The power of deep learning compromises when applied to biological problems with sparsely labeled data and a data distribution shift. We developed a highly data-efficient model-agnostic semi-supervised meta-learning framework DESSML to address these challenges, and applied it to investigate understud...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, You, Xie, Li, Liu, Yang, Xie, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245663/
https://www.ncbi.nlm.nih.gov/pubmed/37292680
http://dx.doi.org/10.1101/2023.05.17.541172
Descripción
Sumario:The power of deep learning compromises when applied to biological problems with sparsely labeled data and a data distribution shift. We developed a highly data-efficient model-agnostic semi-supervised meta-learning framework DESSML to address these challenges, and applied it to investigate understudied interspecies metabolite-protein interactions (MPI). Knowledge of interspecies MPIs is crucial to understand microbiome-host interactions. However, our understanding of interspecies MPIs is extremely poor due to experimental limitations. The paucity of experimental data also hampers the application of machine learning. DESSML successfully explores unlabeled data and transfers the information of intraspecies chemical-protein interactions to the interspecies MPI predictions. It achieves three times improvement in the prediction-recall over the baseline model. Using DESSML, we reveal novel MPIs that are validated by bioactivity assays and fill in missing links in microbiome-human interactions. DESSML is a general framework to explore previously unrecognized biological domains beyond the reach of present experimental techniques.