Cargando…

AllesTM: predicting multiple structural features of transmembrane proteins

BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural...

Descripción completa

Detalles Bibliográficos
Autores principales: Hönigschmid, Peter, Breimann, Stephan, Weigl, Martina, Frishman, Dmitrij
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291640/
https://www.ncbi.nlm.nih.gov/pubmed/32532211
http://dx.doi.org/10.1186/s12859-020-03581-8
_version_ 1783545947101331456
author Hönigschmid, Peter
Breimann, Stephan
Weigl, Martina
Frishman, Dmitrij
author_facet Hönigschmid, Peter
Breimann, Stephan
Weigl, Martina
Frishman, Dmitrij
author_sort Hönigschmid, Peter
collection PubMed
description BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and thus facilitate the development of multi-target methods for predicting several protein properties at once. RESULTS: We present AllesTM, an integrated tool to predict almost all structural features of transmembrane proteins that can be extracted from atomic coordinate data. It blends several machine learning algorithms: random forests and gradient boosting machines, convolutional neural networks in their original form as well as those enhanced by dilated convolutions and residual connections, and, finally, long short-term memory architectures. AllesTM outperforms other available methods in predicting residue depth in the membrane, flexibility, topology, relative solvent accessibility in its bound state, while in torsion angles, secondary structure and monomer relative solvent accessibility prediction it lags only slightly behind the currently leading technique SPOT-1D. High accuracy on a multitude of prediction targets and easy installation make AllesTM a one-stop shop for many typical problems in the structural bioinformatics of transmembrane proteins. CONCLUSIONS: In addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we also provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance. AllesTM is freely available at https://github.com/phngs/allestm.
format Online
Article
Text
id pubmed-7291640
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72916402020-06-12 AllesTM: predicting multiple structural features of transmembrane proteins Hönigschmid, Peter Breimann, Stephan Weigl, Martina Frishman, Dmitrij BMC Bioinformatics Methodology Article BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and thus facilitate the development of multi-target methods for predicting several protein properties at once. RESULTS: We present AllesTM, an integrated tool to predict almost all structural features of transmembrane proteins that can be extracted from atomic coordinate data. It blends several machine learning algorithms: random forests and gradient boosting machines, convolutional neural networks in their original form as well as those enhanced by dilated convolutions and residual connections, and, finally, long short-term memory architectures. AllesTM outperforms other available methods in predicting residue depth in the membrane, flexibility, topology, relative solvent accessibility in its bound state, while in torsion angles, secondary structure and monomer relative solvent accessibility prediction it lags only slightly behind the currently leading technique SPOT-1D. High accuracy on a multitude of prediction targets and easy installation make AllesTM a one-stop shop for many typical problems in the structural bioinformatics of transmembrane proteins. CONCLUSIONS: In addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we also provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance. AllesTM is freely available at https://github.com/phngs/allestm. BioMed Central 2020-06-12 /pmc/articles/PMC7291640/ /pubmed/32532211 http://dx.doi.org/10.1186/s12859-020-03581-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Hönigschmid, Peter
Breimann, Stephan
Weigl, Martina
Frishman, Dmitrij
AllesTM: predicting multiple structural features of transmembrane proteins
title AllesTM: predicting multiple structural features of transmembrane proteins
title_full AllesTM: predicting multiple structural features of transmembrane proteins
title_fullStr AllesTM: predicting multiple structural features of transmembrane proteins
title_full_unstemmed AllesTM: predicting multiple structural features of transmembrane proteins
title_short AllesTM: predicting multiple structural features of transmembrane proteins
title_sort allestm: predicting multiple structural features of transmembrane proteins
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291640/
https://www.ncbi.nlm.nih.gov/pubmed/32532211
http://dx.doi.org/10.1186/s12859-020-03581-8
work_keys_str_mv AT honigschmidpeter allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins
AT breimannstephan allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins
AT weiglmartina allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins
AT frishmandmitrij allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins