Cargando…
Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology
BACKGROUND: Genetic and genomic data analyses are outputting large sets of genes. Functional comparison of these gene sets is a key part of the analysis, as it identifies their shared functions, and the functions that distinguish each set. The Gene Ontology (GO) initiative provides a unified referen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3904913/ https://www.ncbi.nlm.nih.gov/pubmed/24489737 http://dx.doi.org/10.1371/journal.pone.0086525 |
_version_ | 1782301257324560384 |
---|---|
author | Bettembourg, Charles Diot, Christian Dameron, Olivier |
author_facet | Bettembourg, Charles Diot, Christian Dameron, Olivier |
author_sort | Bettembourg, Charles |
collection | PubMed |
description | BACKGROUND: Genetic and genomic data analyses are outputting large sets of genes. Functional comparison of these gene sets is a key part of the analysis, as it identifies their shared functions, and the functions that distinguish each set. The Gene Ontology (GO) initiative provides a unified reference for analyzing the genes molecular functions, biological processes and cellular components. Numerous semantic similarity measures have been developed to systematically quantify the weight of the GO terms shared by two genes. We studied how gene set comparisons can be improved by considering gene set particularity in addition to gene set similarity. RESULTS: We propose a new approach to compute gene set particularities based on the information conveyed by GO terms. A GO term informativeness can be computed using either its information content based on the term frequency in a corpus, or a function of the term's distance to the root. We defined the semantic particularity of a set of GO terms Sg1 compared to another set of GO terms Sg2. We combined our particularity measure with a similarity measure to compare gene sets. We demonstrated that the combination of semantic similarity and semantic particularity measures was able to identify genes with particular functions from among similar genes. This differentiation was not recognized using only a semantic similarity measure. CONCLUSION: Semantic particularity should be used in conjunction with semantic similarity to perform functional analysis of GO-annotated gene sets. The principle is generalizable to other ontologies. |
format | Online Article Text |
id | pubmed-3904913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39049132014-01-31 Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology Bettembourg, Charles Diot, Christian Dameron, Olivier PLoS One Research Article BACKGROUND: Genetic and genomic data analyses are outputting large sets of genes. Functional comparison of these gene sets is a key part of the analysis, as it identifies their shared functions, and the functions that distinguish each set. The Gene Ontology (GO) initiative provides a unified reference for analyzing the genes molecular functions, biological processes and cellular components. Numerous semantic similarity measures have been developed to systematically quantify the weight of the GO terms shared by two genes. We studied how gene set comparisons can be improved by considering gene set particularity in addition to gene set similarity. RESULTS: We propose a new approach to compute gene set particularities based on the information conveyed by GO terms. A GO term informativeness can be computed using either its information content based on the term frequency in a corpus, or a function of the term's distance to the root. We defined the semantic particularity of a set of GO terms Sg1 compared to another set of GO terms Sg2. We combined our particularity measure with a similarity measure to compare gene sets. We demonstrated that the combination of semantic similarity and semantic particularity measures was able to identify genes with particular functions from among similar genes. This differentiation was not recognized using only a semantic similarity measure. CONCLUSION: Semantic particularity should be used in conjunction with semantic similarity to perform functional analysis of GO-annotated gene sets. The principle is generalizable to other ontologies. Public Library of Science 2014-01-28 /pmc/articles/PMC3904913/ /pubmed/24489737 http://dx.doi.org/10.1371/journal.pone.0086525 Text en © 2014 Bettembourg et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Bettembourg, Charles Diot, Christian Dameron, Olivier Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title | Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title_full | Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title_fullStr | Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title_full_unstemmed | Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title_short | Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology |
title_sort | semantic particularity measure for functional characterization of gene sets using gene ontology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3904913/ https://www.ncbi.nlm.nih.gov/pubmed/24489737 http://dx.doi.org/10.1371/journal.pone.0086525 |
work_keys_str_mv | AT bettembourgcharles semanticparticularitymeasureforfunctionalcharacterizationofgenesetsusinggeneontology AT diotchristian semanticparticularitymeasureforfunctionalcharacterizationofgenesetsusinggeneontology AT dameronolivier semanticparticularitymeasureforfunctionalcharacterizationofgenesetsusinggeneontology |