Cargando…
Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework
Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantif...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5428484/ https://www.ncbi.nlm.nih.gov/pubmed/28336965 http://dx.doi.org/10.1038/s41598-017-00465-5 |
_version_ | 1783235831029301248 |
---|---|
author | Weichenberger, Christian X. Palermo, Antonia Pramstaller, Peter P. Domingues, Francisco S. |
author_facet | Weichenberger, Christian X. Palermo, Antonia Pramstaller, Peter P. Domingues, Francisco S. |
author_sort | Weichenberger, Christian X. |
collection | PubMed |
description | Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantified by combining the GO hierarchy with an annotation corpus that links genes and gene products to GO terms. One large group of algorithms involves calculation of GO term semantic similarity (SS) between all the terms annotating the two proteins, followed by a second step, described as “mixing strategy”, which involves combining the SS values to yield the final FS value. Due to the variability of protein annotation caused e.g. by annotation bias, this value cannot be reliably compared on an absolute scale. We therefore introduce a similarity z-score that takes into account the FS background distribution of each protein. For a selection of popular SS measures and mixing strategies we demonstrate moderate accuracy improvement when using z-scores in a benchmark that aims to separate orthologous cases from random gene pairs and discuss in this context the impact of annotation corpus choice. The approach has been implemented in Frela, a fast high-throughput public web server for protein FS calculation and interpretation. |
format | Online Article Text |
id | pubmed-5428484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-54284842017-05-15 Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework Weichenberger, Christian X. Palermo, Antonia Pramstaller, Peter P. Domingues, Francisco S. Sci Rep Article Protein functional similarity based on gene ontology (GO) annotations serves as a powerful tool when comparing proteins on a functional level in applications such as protein-protein interaction prediction, gene prioritization, and disease gene discovery. Functional similarity (FS) is usually quantified by combining the GO hierarchy with an annotation corpus that links genes and gene products to GO terms. One large group of algorithms involves calculation of GO term semantic similarity (SS) between all the terms annotating the two proteins, followed by a second step, described as “mixing strategy”, which involves combining the SS values to yield the final FS value. Due to the variability of protein annotation caused e.g. by annotation bias, this value cannot be reliably compared on an absolute scale. We therefore introduce a similarity z-score that takes into account the FS background distribution of each protein. For a selection of popular SS measures and mixing strategies we demonstrate moderate accuracy improvement when using z-scores in a benchmark that aims to separate orthologous cases from random gene pairs and discuss in this context the impact of annotation corpus choice. The approach has been implemented in Frela, a fast high-throughput public web server for protein FS calculation and interpretation. Nature Publishing Group UK 2017-03-23 /pmc/articles/PMC5428484/ /pubmed/28336965 http://dx.doi.org/10.1038/s41598-017-00465-5 Text en © The Author(s) 2017 This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Weichenberger, Christian X. Palermo, Antonia Pramstaller, Peter P. Domingues, Francisco S. Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title | Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title_full | Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title_fullStr | Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title_full_unstemmed | Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title_short | Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework |
title_sort | exploring approaches for detecting protein functional similarity within an orthology-based framework |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5428484/ https://www.ncbi.nlm.nih.gov/pubmed/28336965 http://dx.doi.org/10.1038/s41598-017-00465-5 |
work_keys_str_mv | AT weichenbergerchristianx exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework AT palermoantonia exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework AT pramstallerpeterp exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework AT dominguesfranciscos exploringapproachesfordetectingproteinfunctionalsimilaritywithinanorthologybasedframework |