Cargando…

Going from where to why—interpretable prediction of protein subcellular localization

Motivation: Protein subcellular localization is pivotal in understanding a protein's function. Computational prediction of subcellular localization has become a viable alternative to experimental approaches. While current machine learning-based methods yield good prediction accuracy, most of th...

Descripción completa

Detalles Bibliográficos
Autores principales:	Briesemeister, Sebastian, Rahnenführer, Jörg, Kohlbacher, Oliver
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859129/ https://www.ncbi.nlm.nih.gov/pubmed/20299325 http://dx.doi.org/10.1093/bioinformatics/btq115

_version_	1782180482734096384
author	Briesemeister, Sebastian Rahnenführer, Jörg Kohlbacher, Oliver
author_facet	Briesemeister, Sebastian Rahnenführer, Jörg Kohlbacher, Oliver
author_sort	Briesemeister, Sebastian
collection	PubMed
description	Motivation: Protein subcellular localization is pivotal in understanding a protein's function. Computational prediction of subcellular localization has become a viable alternative to experimental approaches. While current machine learning-based methods yield good prediction accuracy, most of them suffer from two key problems: lack of interpretability and dealing with multiple locations. Results: We present YLoc, a novel method for predicting protein subcellular localization that addresses these issues. Due to its simple architecture, YLoc can identify the relevant features of a protein sequence contributing to its subcellular localization, e.g. localization signals or motifs relevant to protein sorting. We present several example applications where YLoc identifies the sequence features responsible for protein localization, and thus reveals not only to which location a protein is transported to, but also why it is transported there. YLoc also provides a confidence estimate for the prediction. Thus, the user can decide what level of error is acceptable for a prediction. Due to a probabilistic approach and the use of several thousands of dual-targeted proteins, YLoc is able to predict multiple locations per protein. YLoc was benchmarked using several independent datasets for protein subcellular localization and performs on par with other state-of-the-art predictors. Disregarding low-confidence predictions, YLoc can achieve prediction accuracies of over 90%. Moreover, we show that YLoc is able to reliably predict multiple locations and outperforms the best predictors in this area. Availability: www.multiloc.org/YLoc Contact: briese@informatik.uni-tuebingen.de Supplementary information: Supplementary data are available at Bioinformatics online.
format	Text
id	pubmed-2859129
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28591292010-04-26 Going from where to why—interpretable prediction of protein subcellular localization Briesemeister, Sebastian Rahnenführer, Jörg Kohlbacher, Oliver Bioinformatics Original Papers Motivation: Protein subcellular localization is pivotal in understanding a protein's function. Computational prediction of subcellular localization has become a viable alternative to experimental approaches. While current machine learning-based methods yield good prediction accuracy, most of them suffer from two key problems: lack of interpretability and dealing with multiple locations. Results: We present YLoc, a novel method for predicting protein subcellular localization that addresses these issues. Due to its simple architecture, YLoc can identify the relevant features of a protein sequence contributing to its subcellular localization, e.g. localization signals or motifs relevant to protein sorting. We present several example applications where YLoc identifies the sequence features responsible for protein localization, and thus reveals not only to which location a protein is transported to, but also why it is transported there. YLoc also provides a confidence estimate for the prediction. Thus, the user can decide what level of error is acceptable for a prediction. Due to a probabilistic approach and the use of several thousands of dual-targeted proteins, YLoc is able to predict multiple locations per protein. YLoc was benchmarked using several independent datasets for protein subcellular localization and performs on par with other state-of-the-art predictors. Disregarding low-confidence predictions, YLoc can achieve prediction accuracies of over 90%. Moreover, we show that YLoc is able to reliably predict multiple locations and outperforms the best predictors in this area. Availability: www.multiloc.org/YLoc Contact: briese@informatik.uni-tuebingen.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-05-01 2010-03-17 /pmc/articles/PMC2859129/ /pubmed/20299325 http://dx.doi.org/10.1093/bioinformatics/btq115 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Briesemeister, Sebastian Rahnenführer, Jörg Kohlbacher, Oliver Going from where to why—interpretable prediction of protein subcellular localization
title	Going from where to why—interpretable prediction of protein subcellular localization
title_full	Going from where to why—interpretable prediction of protein subcellular localization
title_fullStr	Going from where to why—interpretable prediction of protein subcellular localization
title_full_unstemmed	Going from where to why—interpretable prediction of protein subcellular localization
title_short	Going from where to why—interpretable prediction of protein subcellular localization
title_sort	going from where to why—interpretable prediction of protein subcellular localization
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859129/ https://www.ncbi.nlm.nih.gov/pubmed/20299325 http://dx.doi.org/10.1093/bioinformatics/btq115
work_keys_str_mv	AT briesemeistersebastian goingfromwheretowhyinterpretablepredictionofproteinsubcellularlocalization AT rahnenfuhrerjorg goingfromwheretowhyinterpretablepredictionofproteinsubcellularlocalization AT kohlbacheroliver goingfromwheretowhyinterpretablepredictionofproteinsubcellularlocalization

Going from where to why—interpretable prediction of protein subcellular localization

Ejemplares similares