Cargando…

Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework

Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantif...

Descripción completa

Detalles Bibliográficos
Autores principales: Weichenberger, Christian X., Palermo, Antonia, Pramstaller, Peter P., Domingues, Francisco S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5428484/
https://www.ncbi.nlm.nih.gov/pubmed/28336965
http://dx.doi.org/10.1038/s41598-017-00465-5
_version_ 1783235831029301248
author Weichenberger, Christian X.
Palermo, Antonia
Pramstaller, Peter P.
Domingues, Francisco S.
author_facet Weichenberger, Christian X.
Palermo, Antonia
Pramstaller, Peter P.
Domingues, Francisco S.
author_sort Weichenberger, Christian X.
collection PubMed
description Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantified by combining the GO hierarchy with an annotation corpus that links genes and gene products to GO terms. One large group of algorithms involves calculation of GO term semantic similarity (SS) between all the terms annotating the two proteins, followed by a second step, described as “mixing strategy”, which involves combining the SS values to yield the final FS value. Due to the variability of protein annotation caused e.g. by annotation bias, this value cannot be reliably compared on an absolute scale. We therefore introduce a similarity z-score that takes into account the FS background distribution of each protein. For a selection of popular SS measures and mixing strategies we demonstrate moderate accuracy improvement when using z-scores in a benchmark that aims to separate orthologous cases from random gene pairs and discuss in this context the impact of annotation corpus choice. The approach has been implemented in Frela, a fast high-throughput public web server for protein FS calculation and interpretation.
format Online
Article
Text
id pubmed-5428484
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-54284842017-05-15 Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework Weichenberger, Christian X. Palermo, Antonia Pramstaller, Peter P. Domingues, Francisco S. Sci Rep Article Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantified by combining the GO hierarchy with an annotation corpus that links genes and gene products to GO terms. One large group of algorithms involves calculation of GO term semantic similarity (SS) between all the terms annotating the two proteins, followed by a second step, described as “mixing strategy”, which involves combining the SS values to yield the final FS value. Due to the variability of protein annotation caused e.g. by annotation bias, this value cannot be reliably compared on an absolute scale. We therefore introduce a similarity z-score that takes into account the FS background distribution of each protein. For a selection of popular SS measures and mixing strategies we demonstrate moderate accuracy improvement when using z-scores in a benchmark that aims to separate orthologous cases from random gene pairs and discuss in this context the impact of annotation corpus choice. The approach has been implemented in Frela, a fast high-throughput public web server for protein FS calculation and interpretation. Nature Publishing Group UK 2017-03-23 /pmc/articles/PMC5428484/ /pubmed/28336965 http://dx.doi.org/10.1038/s41598-017-00465-5 Text en © The Author(s) 2017 This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Weichenberger, Christian X.
Palermo, Antonia
Pramstaller, Peter P.
Domingues, Francisco S.
Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title_full Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title_fullStr Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title_full_unstemmed Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title_short Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
title_sort exploring approaches for detecting protein functional similarity within an orthology-based framework
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5428484/
https://www.ncbi.nlm.nih.gov/pubmed/28336965
http://dx.doi.org/10.1038/s41598-017-00465-5
work_keys_str_mv AT weichenbergerchristianx exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework
AT palermoantonia exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework
AT pramstallerpeterp exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework
AT dominguesfranciscos exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework