Cargando…

Towards region-specific propagation of protein functions

MOTIVATION: Due to the nature of experimental annotation, most protein function prediction methods operate at the protein-level, where functions are assigned to full-length proteins based on overall similarities. However, most proteins function by interacting with other proteins or molecules, and ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Koo, Da Chen Emily, Bonneau, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6513163/
https://www.ncbi.nlm.nih.gov/pubmed/30304483
http://dx.doi.org/10.1093/bioinformatics/bty834
_version_ 1783417732093444096
author Koo, Da Chen Emily
Bonneau, Richard
author_facet Koo, Da Chen Emily
Bonneau, Richard
author_sort Koo, Da Chen Emily
collection PubMed
description MOTIVATION: Due to the nature of experimental annotation, most protein function prediction methods operate at the protein-level, where functions are assigned to full-length proteins based on overall similarities. However, most proteins function by interacting with other proteins or molecules, and many functional associations should be limited to specific regions rather than the entire protein length. Most domain-centric function prediction methods depend on accurate domain family assignments to infer relationships between domains and functions, with regions that are unassigned to a known domain-family left out of functional evaluation. Given the abundance of residue-level annotations currently available, we present a function prediction methodology that automatically infers function labels of specific protein regions using protein-level annotations and multiple types of region-specific features. RESULTS: We apply this method to local features obtained from InterPro, UniProtKB and amino acid sequences and show that this method improves both the accuracy and region-specificity of protein function transfer and prediction. We compare region-level predictive performance of our method against that of a whole-protein baseline method using proteins with structurally verified binding sites and also compare protein-level temporal holdout predictive performances to expand the variety and specificity of GO terms we could evaluate. Our results can also serve as a starting point to categorize GO terms into region-specific and whole-protein terms and select prediction methods for different classes of GO terms. AVAILABILITY AND IMPLEMENTATION: The code and features are freely available at: https://github.com/ek1203/rsfp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6513163
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65131632019-05-20 Towards region-specific propagation of protein functions Koo, Da Chen Emily Bonneau, Richard Bioinformatics Original Papers MOTIVATION: Due to the nature of experimental annotation, most protein function prediction methods operate at the protein-level, where functions are assigned to full-length proteins based on overall similarities. However, most proteins function by interacting with other proteins or molecules, and many functional associations should be limited to specific regions rather than the entire protein length. Most domain-centric function prediction methods depend on accurate domain family assignments to infer relationships between domains and functions, with regions that are unassigned to a known domain-family left out of functional evaluation. Given the abundance of residue-level annotations currently available, we present a function prediction methodology that automatically infers function labels of specific protein regions using protein-level annotations and multiple types of region-specific features. RESULTS: We apply this method to local features obtained from InterPro, UniProtKB and amino acid sequences and show that this method improves both the accuracy and region-specificity of protein function transfer and prediction. We compare region-level predictive performance of our method against that of a whole-protein baseline method using proteins with structurally verified binding sites and also compare protein-level temporal holdout predictive performances to expand the variety and specificity of GO terms we could evaluate. Our results can also serve as a starting point to categorize GO terms into region-specific and whole-protein terms and select prediction methods for different classes of GO terms. AVAILABILITY AND IMPLEMENTATION: The code and features are freely available at: https://github.com/ek1203/rsfp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-05-15 2018-10-09 /pmc/articles/PMC6513163/ /pubmed/30304483 http://dx.doi.org/10.1093/bioinformatics/bty834 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Koo, Da Chen Emily
Bonneau, Richard
Towards region-specific propagation of protein functions
title Towards region-specific propagation of protein functions
title_full Towards region-specific propagation of protein functions
title_fullStr Towards region-specific propagation of protein functions
title_full_unstemmed Towards region-specific propagation of protein functions
title_short Towards region-specific propagation of protein functions
title_sort towards region-specific propagation of protein functions
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6513163/
https://www.ncbi.nlm.nih.gov/pubmed/30304483
http://dx.doi.org/10.1093/bioinformatics/bty834
work_keys_str_mv AT koodachenemily towardsregionspecificpropagationofproteinfunctions
AT bonneaurichard towardsregionspecificpropagationofproteinfunctions