Cargando…

New concepts for building vocabulary for cell image ontologies

BACKGROUND: There are significant challenges associated with the building of ontologies for cell biology experiments including the large numbers of terms and their synonyms. These challenges make it difficult to simultaneously query data from multiple experiments or ontologies. If vocabulary terms w...

Descripción completa

Detalles Bibliográficos
Autores principales: Plant, Anne L, Elliott, John T, Bhat, Talapady N
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3293096/
https://www.ncbi.nlm.nih.gov/pubmed/22188658
http://dx.doi.org/10.1186/1471-2105-12-487
_version_ 1782225371572207616
author Plant, Anne L
Elliott, John T
Bhat, Talapady N
author_facet Plant, Anne L
Elliott, John T
Bhat, Talapady N
author_sort Plant, Anne L
collection PubMed
description BACKGROUND: There are significant challenges associated with the building of ontologies for cell biology experiments including the large numbers of terms and their synonyms. These challenges make it difficult to simultaneously query data from multiple experiments or ontologies. If vocabulary terms were consistently used and reused across and within ontologies, queries would be possible through shared terms. One approach to achieving this is to strictly control the terms used in ontologies in the form of a pre-defined schema, but this approach limits the individual researcher's ability to create new terms when needed to describe new experiments. RESULTS: Here, we propose the use of a limited number of highly reusable common root terms, and rules for an experimentalist to locally expand terms by adding more specific terms under more general root terms to form specific new vocabulary hierarchies that can be used to build ontologies. We illustrate the application of the method to build vocabularies and a prototype database for cell images that uses a visual data-tree of terms to facilitate sophisticated queries based on a experimental parameters. We demonstrate how the terminology might be extended by adding new vocabulary terms into the hierarchy of terms in an evolving process. In this approach, image data and metadata are handled separately, so we also describe a robust file-naming scheme to unambiguously identify image and other files associated with each metadata value. The prototype database http://sbd.nist.gov/ consists of more than 2000 images of cells and benchmark materials, and 163 metadata terms that describe experimental details, including many details about cell culture and handling. Image files of interest can be retrieved, and their data can be compared, by choosing one or more relevant metadata values as search terms. Metadata values for any dataset can be compared with corresponding values of another dataset through logical operations. CONCLUSIONS: Organizing metadata for cell imaging experiments under a framework of rules that include highly reused root terms will facilitate the addition of new terms into a vocabulary hierarchy and encourage the reuse of terms. These vocabulary hierarchies can be converted into XML schema or RDF graphs for displaying and querying, but this is not necessary for using it to annotate cell images. Vocabulary data trees from multiple experiments or laboratories can be aligned at the root terms to facilitate query development. This approach of developing vocabularies is compatible with the major advances in database technology and could be used for building the Semantic Web.
format Online
Article
Text
id pubmed-3293096
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32930962012-03-05 New concepts for building vocabulary for cell image ontologies Plant, Anne L Elliott, John T Bhat, Talapady N BMC Bioinformatics Research Article BACKGROUND: There are significant challenges associated with the building of ontologies for cell biology experiments including the large numbers of terms and their synonyms. These challenges make it difficult to simultaneously query data from multiple experiments or ontologies. If vocabulary terms were consistently used and reused across and within ontologies, queries would be possible through shared terms. One approach to achieving this is to strictly control the terms used in ontologies in the form of a pre-defined schema, but this approach limits the individual researcher's ability to create new terms when needed to describe new experiments. RESULTS: Here, we propose the use of a limited number of highly reusable common root terms, and rules for an experimentalist to locally expand terms by adding more specific terms under more general root terms to form specific new vocabulary hierarchies that can be used to build ontologies. We illustrate the application of the method to build vocabularies and a prototype database for cell images that uses a visual data-tree of terms to facilitate sophisticated queries based on a experimental parameters. We demonstrate how the terminology might be extended by adding new vocabulary terms into the hierarchy of terms in an evolving process. In this approach, image data and metadata are handled separately, so we also describe a robust file-naming scheme to unambiguously identify image and other files associated with each metadata value. The prototype database http://sbd.nist.gov/ consists of more than 2000 images of cells and benchmark materials, and 163 metadata terms that describe experimental details, including many details about cell culture and handling. Image files of interest can be retrieved, and their data can be compared, by choosing one or more relevant metadata values as search terms. Metadata values for any dataset can be compared with corresponding values of another dataset through logical operations. CONCLUSIONS: Organizing metadata for cell imaging experiments under a framework of rules that include highly reused root terms will facilitate the addition of new terms into a vocabulary hierarchy and encourage the reuse of terms. These vocabulary hierarchies can be converted into XML schema or RDF graphs for displaying and querying, but this is not necessary for using it to annotate cell images. Vocabulary data trees from multiple experiments or laboratories can be aligned at the root terms to facilitate query development. This approach of developing vocabularies is compatible with the major advances in database technology and could be used for building the Semantic Web. BioMed Central 2011-12-21 /pmc/articles/PMC3293096/ /pubmed/22188658 http://dx.doi.org/10.1186/1471-2105-12-487 Text en Copyright ©2011 Plant et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Plant, Anne L
Elliott, John T
Bhat, Talapady N
New concepts for building vocabulary for cell image ontologies
title New concepts for building vocabulary for cell image ontologies
title_full New concepts for building vocabulary for cell image ontologies
title_fullStr New concepts for building vocabulary for cell image ontologies
title_full_unstemmed New concepts for building vocabulary for cell image ontologies
title_short New concepts for building vocabulary for cell image ontologies
title_sort new concepts for building vocabulary for cell image ontologies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3293096/
https://www.ncbi.nlm.nih.gov/pubmed/22188658
http://dx.doi.org/10.1186/1471-2105-12-487
work_keys_str_mv AT plantannel newconceptsforbuildingvocabularyforcellimageontologies
AT elliottjohnt newconceptsforbuildingvocabularyforcellimageontologies
AT bhattalapadyn newconceptsforbuildingvocabularyforcellimageontologies