Cargando…

Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study

Ontologies support automatic sharing, combination and analysis of life sciences data. They undergo regular curation and enrichment. We studied the impact of an ontology evolution on its structural complexity. As a case study we used the sixty monthly releases between January 2008 and December 2012 o...

Descripción completa

Detalles Bibliográficos
Autores principales: Dameron, Olivier, Bettembourg, Charles, Le Meur, Nolwenn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3795689/
https://www.ncbi.nlm.nih.gov/pubmed/24146805
http://dx.doi.org/10.1371/journal.pone.0075993
_version_ 1782287414282158080
author Dameron, Olivier
Bettembourg, Charles
Le Meur, Nolwenn
author_facet Dameron, Olivier
Bettembourg, Charles
Le Meur, Nolwenn
author_sort Dameron, Olivier
collection PubMed
description Ontologies support automatic sharing, combination and analysis of life sciences data. They undergo regular curation and enrichment. We studied the impact of an ontology evolution on its structural complexity. As a case study we used the sixty monthly releases between January 2008 and December 2012 of the Gene Ontology and its three independent branches, i.e. biological processes (BP), cellular components (CC) and molecular functions (MF). For each case, we measured complexity by computing metrics related to the size, the nodes connectivity and the hierarchical structure. The number of classes and relations increased monotonously for each branch, with different growth rates. BP and CC had similar connectivity, superior to that of MF. Connectivity increased monotonously for BP, decreased for CC and remained stable for MF, with a marked increase for the three branches in November and December 2012. Hierarchy-related measures showed that CC and MF had similar proportions of leaves, average depths and average heights. BP had a lower proportion of leaves, and a higher average depth and average height. For BP and MF, the late 2012 increase of connectivity resulted in an increase of the average depth and average height and a decrease of the proportion of leaves, indicating that a major enrichment effort of the intermediate-level hierarchy occurred. The variation of the number of classes and relations in an ontology does not provide enough information about the evolution of its complexity. However, connectivity and hierarchy-related metrics revealed different patterns of values as well as of evolution for the three branches of the Gene Ontology. CC was similar to BP in terms of connectivity, and similar to MF in terms of hierarchy. Overall, BP complexity increased, CC was refined with the addition of leaves providing a finer level of annotations but decreasing slightly its complexity, and MF complexity remained stable.
format Online
Article
Text
id pubmed-3795689
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37956892013-10-21 Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study Dameron, Olivier Bettembourg, Charles Le Meur, Nolwenn PLoS One Research Article Ontologies support automatic sharing, combination and analysis of life sciences data. They undergo regular curation and enrichment. We studied the impact of an ontology evolution on its structural complexity. As a case study we used the sixty monthly releases between January 2008 and December 2012 of the Gene Ontology and its three independent branches, i.e. biological processes (BP), cellular components (CC) and molecular functions (MF). For each case, we measured complexity by computing metrics related to the size, the nodes connectivity and the hierarchical structure. The number of classes and relations increased monotonously for each branch, with different growth rates. BP and CC had similar connectivity, superior to that of MF. Connectivity increased monotonously for BP, decreased for CC and remained stable for MF, with a marked increase for the three branches in November and December 2012. Hierarchy-related measures showed that CC and MF had similar proportions of leaves, average depths and average heights. BP had a lower proportion of leaves, and a higher average depth and average height. For BP and MF, the late 2012 increase of connectivity resulted in an increase of the average depth and average height and a decrease of the proportion of leaves, indicating that a major enrichment effort of the intermediate-level hierarchy occurred. The variation of the number of classes and relations in an ontology does not provide enough information about the evolution of its complexity. However, connectivity and hierarchy-related metrics revealed different patterns of values as well as of evolution for the three branches of the Gene Ontology. CC was similar to BP in terms of connectivity, and similar to MF in terms of hierarchy. Overall, BP complexity increased, CC was refined with the addition of leaves providing a finer level of annotations but decreasing slightly its complexity, and MF complexity remained stable. Public Library of Science 2013-10-11 /pmc/articles/PMC3795689/ /pubmed/24146805 http://dx.doi.org/10.1371/journal.pone.0075993 Text en © 2013 Dameron et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dameron, Olivier
Bettembourg, Charles
Le Meur, Nolwenn
Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title_full Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title_fullStr Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title_full_unstemmed Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title_short Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study
title_sort measuring the evolution of ontology complexity: the gene ontology case study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3795689/
https://www.ncbi.nlm.nih.gov/pubmed/24146805
http://dx.doi.org/10.1371/journal.pone.0075993
work_keys_str_mv AT dameronolivier measuringtheevolutionofontologycomplexitythegeneontologycasestudy
AT bettembourgcharles measuringtheevolutionofontologycomplexitythegeneontologycasestudy
AT lemeurnolwenn measuringtheevolutionofontologycomplexitythegeneontologycasestudy