Cargando…

Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization

There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location...

Descripción completa

Detalles Bibliográficos
Autores principales: Bedbrook, Claire N., Yang, Kevin K., Rice, Austin J., Gradinaru, Viviana, Arnold, Frances H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5695628/
https://www.ncbi.nlm.nih.gov/pubmed/29059183
http://dx.doi.org/10.1371/journal.pcbi.1005786
_version_ 1783280344175214592
author Bedbrook, Claire N.
Yang, Kevin K.
Rice, Austin J.
Gradinaru, Viviana
Arnold, Frances H.
author_facet Bedbrook, Claire N.
Yang, Kevin K.
Rice, Austin J.
Gradinaru, Viviana
Arnold, Frances H.
author_sort Bedbrook, Claire N.
collection PubMed
description There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well.
format Online
Article
Text
id pubmed-5695628
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56956282017-11-30 Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization Bedbrook, Claire N. Yang, Kevin K. Rice, Austin J. Gradinaru, Viviana Arnold, Frances H. PLoS Comput Biol Research Article There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well. Public Library of Science 2017-10-23 /pmc/articles/PMC5695628/ /pubmed/29059183 http://dx.doi.org/10.1371/journal.pcbi.1005786 Text en © 2017 Bedbrook et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bedbrook, Claire N.
Yang, Kevin K.
Rice, Austin J.
Gradinaru, Viviana
Arnold, Frances H.
Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title_full Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title_fullStr Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title_full_unstemmed Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title_short Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
title_sort machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5695628/
https://www.ncbi.nlm.nih.gov/pubmed/29059183
http://dx.doi.org/10.1371/journal.pcbi.1005786
work_keys_str_mv AT bedbrookclairen machinelearningtodesignintegralmembranechannelrhodopsinsforefficienteukaryoticexpressionandplasmamembranelocalization
AT yangkevink machinelearningtodesignintegralmembranechannelrhodopsinsforefficienteukaryoticexpressionandplasmamembranelocalization
AT riceaustinj machinelearningtodesignintegralmembranechannelrhodopsinsforefficienteukaryoticexpressionandplasmamembranelocalization
AT gradinaruviviana machinelearningtodesignintegralmembranechannelrhodopsinsforefficienteukaryoticexpressionandplasmamembranelocalization
AT arnoldfrancesh machinelearningtodesignintegralmembranechannelrhodopsinsforefficienteukaryoticexpressionandplasmamembranelocalization