Cargando…
AllesTM: predicting multiple structural features of transmembrane proteins
BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291640/ https://www.ncbi.nlm.nih.gov/pubmed/32532211 http://dx.doi.org/10.1186/s12859-020-03581-8 |
_version_ | 1783545947101331456 |
---|---|
author | Hönigschmid, Peter Breimann, Stephan Weigl, Martina Frishman, Dmitrij |
author_facet | Hönigschmid, Peter Breimann, Stephan Weigl, Martina Frishman, Dmitrij |
author_sort | Hönigschmid, Peter |
collection | PubMed |
description | BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and thus facilitate the development of multi-target methods for predicting several protein properties at once. RESULTS: We present AllesTM, an integrated tool to predict almost all structural features of transmembrane proteins that can be extracted from atomic coordinate data. It blends several machine learning algorithms: random forests and gradient boosting machines, convolutional neural networks in their original form as well as those enhanced by dilated convolutions and residual connections, and, finally, long short-term memory architectures. AllesTM outperforms other available methods in predicting residue depth in the membrane, flexibility, topology, relative solvent accessibility in its bound state, while in torsion angles, secondary structure and monomer relative solvent accessibility prediction it lags only slightly behind the currently leading technique SPOT-1D. High accuracy on a multitude of prediction targets and easy installation make AllesTM a one-stop shop for many typical problems in the structural bioinformatics of transmembrane proteins. CONCLUSIONS: In addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we also provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance. AllesTM is freely available at https://github.com/phngs/allestm. |
format | Online Article Text |
id | pubmed-7291640 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-72916402020-06-12 AllesTM: predicting multiple structural features of transmembrane proteins Hönigschmid, Peter Breimann, Stephan Weigl, Martina Frishman, Dmitrij BMC Bioinformatics Methodology Article BACKGROUND: This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and thus facilitate the development of multi-target methods for predicting several protein properties at once. RESULTS: We present AllesTM, an integrated tool to predict almost all structural features of transmembrane proteins that can be extracted from atomic coordinate data. It blends several machine learning algorithms: random forests and gradient boosting machines, convolutional neural networks in their original form as well as those enhanced by dilated convolutions and residual connections, and, finally, long short-term memory architectures. AllesTM outperforms other available methods in predicting residue depth in the membrane, flexibility, topology, relative solvent accessibility in its bound state, while in torsion angles, secondary structure and monomer relative solvent accessibility prediction it lags only slightly behind the currently leading technique SPOT-1D. High accuracy on a multitude of prediction targets and easy installation make AllesTM a one-stop shop for many typical problems in the structural bioinformatics of transmembrane proteins. CONCLUSIONS: In addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we also provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance. AllesTM is freely available at https://github.com/phngs/allestm. BioMed Central 2020-06-12 /pmc/articles/PMC7291640/ /pubmed/32532211 http://dx.doi.org/10.1186/s12859-020-03581-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Hönigschmid, Peter Breimann, Stephan Weigl, Martina Frishman, Dmitrij AllesTM: predicting multiple structural features of transmembrane proteins |
title | AllesTM: predicting multiple structural features of transmembrane proteins |
title_full | AllesTM: predicting multiple structural features of transmembrane proteins |
title_fullStr | AllesTM: predicting multiple structural features of transmembrane proteins |
title_full_unstemmed | AllesTM: predicting multiple structural features of transmembrane proteins |
title_short | AllesTM: predicting multiple structural features of transmembrane proteins |
title_sort | allestm: predicting multiple structural features of transmembrane proteins |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291640/ https://www.ncbi.nlm.nih.gov/pubmed/32532211 http://dx.doi.org/10.1186/s12859-020-03581-8 |
work_keys_str_mv | AT honigschmidpeter allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins AT breimannstephan allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins AT weiglmartina allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins AT frishmandmitrij allestmpredictingmultiplestructuralfeaturesoftransmembraneproteins |