Cargando…
Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type?
The current increase in Gene Ontology (GO) annotations of proteins in the existing genome databases and their use in different analyses have fostered the improvement of several biomedical and biological applications. To integrate this functional data into different analyses, several protein function...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4256219/ https://www.ncbi.nlm.nih.gov/pubmed/25474538 http://dx.doi.org/10.1371/journal.pone.0113859 |
_version_ | 1782347551239831552 |
---|---|
author | Mazandu, Gaston K. Mulder, Nicola J. |
author_facet | Mazandu, Gaston K. Mulder, Nicola J. |
author_sort | Mazandu, Gaston K. |
collection | PubMed |
description | The current increase in Gene Ontology (GO) annotations of proteins in the existing genome databases and their use in different analyses have fostered the improvement of several biomedical and biological applications. To integrate this functional data into different analyses, several protein functional similarity measures based on GO term information content (IC) have been proposed and evaluated, especially in the context of annotation-based measures. In the case of topology-based measures, each approach was set with a specific functional similarity measure depending on its conception and applications for which it was designed. However, it is not clear whether a specific functional similarity measure associated with a given approach is the most appropriate, given a biological data set or an application, i.e., achieving the best performance compared to other functional similarity measures for the biological application under consideration. We show that, in general, a specific functional similarity measure often used with a given term IC or term semantic similarity approach is not always the best for different biological data and applications. We have conducted a performance evaluation of a number of different functional similarity measures using different types of biological data in order to infer the best functional similarity measure for each different term IC and semantic similarity approach. The comparisons of different protein functional similarity measures should help researchers choose the most appropriate measure for the biological application under consideration. |
format | Online Article Text |
id | pubmed-4256219 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42562192014-12-11 Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? Mazandu, Gaston K. Mulder, Nicola J. PLoS One Research Article The current increase in Gene Ontology (GO) annotations of proteins in the existing genome databases and their use in different analyses have fostered the improvement of several biomedical and biological applications. To integrate this functional data into different analyses, several protein functional similarity measures based on GO term information content (IC) have been proposed and evaluated, especially in the context of annotation-based measures. In the case of topology-based measures, each approach was set with a specific functional similarity measure depending on its conception and applications for which it was designed. However, it is not clear whether a specific functional similarity measure associated with a given approach is the most appropriate, given a biological data set or an application, i.e., achieving the best performance compared to other functional similarity measures for the biological application under consideration. We show that, in general, a specific functional similarity measure often used with a given term IC or term semantic similarity approach is not always the best for different biological data and applications. We have conducted a performance evaluation of a number of different functional similarity measures using different types of biological data in order to infer the best functional similarity measure for each different term IC and semantic similarity approach. The comparisons of different protein functional similarity measures should help researchers choose the most appropriate measure for the biological application under consideration. Public Library of Science 2014-12-04 /pmc/articles/PMC4256219/ /pubmed/25474538 http://dx.doi.org/10.1371/journal.pone.0113859 Text en © 2014 Mazandu, Mulder http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Mazandu, Gaston K. Mulder, Nicola J. Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title | Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title_full | Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title_fullStr | Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title_full_unstemmed | Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title_short | Information Content-Based Gene Ontology Functional Similarity Measures: Which One to Use for a Given Biological Data Type? |
title_sort | information content-based gene ontology functional similarity measures: which one to use for a given biological data type? |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4256219/ https://www.ncbi.nlm.nih.gov/pubmed/25474538 http://dx.doi.org/10.1371/journal.pone.0113859 |
work_keys_str_mv | AT mazandugastonk informationcontentbasedgeneontologyfunctionalsimilaritymeasureswhichonetouseforagivenbiologicaldatatype AT muldernicolaj informationcontentbasedgeneontologyfunctionalsimilaritymeasureswhichonetouseforagivenbiologicaldatatype |