Cargando…

Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology

Hymenoptera, the insect order that includes sawflies, bees, wasps, and ants, exhibits an incredible diversity of phenotypes, with over 145,000 species described in a corpus of textual knowledge since Carolus Linnaeus. In the absence of specialized training, often spanning decades, however, these art...

Descripción completa

Detalles Bibliográficos
Autores principales: Seltmann, Katja C., Pénzes, Zsolt, Yoder, Matthew J., Bertone, Matthew A., Deans, Andrew R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575469/
https://www.ncbi.nlm.nih.gov/pubmed/23441153
http://dx.doi.org/10.1371/journal.pone.0055674
_version_ 1782259732249051136
author Seltmann, Katja C.
Pénzes, Zsolt
Yoder, Matthew J.
Bertone, Matthew A.
Deans, Andrew R.
author_facet Seltmann, Katja C.
Pénzes, Zsolt
Yoder, Matthew J.
Bertone, Matthew A.
Deans, Andrew R.
author_sort Seltmann, Katja C.
collection PubMed
description Hymenoptera, the insect order that includes sawflies, bees, wasps, and ants, exhibits an incredible diversity of phenotypes, with over 145,000 species described in a corpus of textual knowledge since Carolus Linnaeus. In the absence of specialized training, often spanning decades, however, these articles can be challenging to decipher. Much of the vocabulary is domain-specific (e.g., Hymenoptera biology), historically without a comprehensive glossary, and contains much homonymous and synonymous terminology. The Hymenoptera Anatomy Ontology was developed to surmount this challenge and to aid future communication related to hymenopteran anatomy, as well as provide support for domain experts so they may actively benefit from the anatomy ontology development. As part of HAO development, an active learning, dictionary-based, natural language recognition tool was implemented to facilitate Hymenoptera anatomy term discovery in literature. We present this tool, referred to as the ‘Proofer’, as part of an iterative approach to growing phenotype-relevant ontologies, regardless of domain. The process of ontology development results in a critical mass of terms that is applied as a filter to the source collection of articles in order to reveal term occurrence and biases in natural language species descriptions. Our results indicate that taxonomists use domain-specific terminology that follows taxonomic specialization, particularly at superfamily and family level groupings and that the developed Proofer tool is effective for term discovery, facilitating ontology construction.
format Online
Article
Text
id pubmed-3575469
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35754692013-02-25 Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology Seltmann, Katja C. Pénzes, Zsolt Yoder, Matthew J. Bertone, Matthew A. Deans, Andrew R. PLoS One Research Article Hymenoptera, the insect order that includes sawflies, bees, wasps, and ants, exhibits an incredible diversity of phenotypes, with over 145,000 species described in a corpus of textual knowledge since Carolus Linnaeus. In the absence of specialized training, often spanning decades, however, these articles can be challenging to decipher. Much of the vocabulary is domain-specific (e.g., Hymenoptera biology), historically without a comprehensive glossary, and contains much homonymous and synonymous terminology. The Hymenoptera Anatomy Ontology was developed to surmount this challenge and to aid future communication related to hymenopteran anatomy, as well as provide support for domain experts so they may actively benefit from the anatomy ontology development. As part of HAO development, an active learning, dictionary-based, natural language recognition tool was implemented to facilitate Hymenoptera anatomy term discovery in literature. We present this tool, referred to as the ‘Proofer’, as part of an iterative approach to growing phenotype-relevant ontologies, regardless of domain. The process of ontology development results in a critical mass of terms that is applied as a filter to the source collection of articles in order to reveal term occurrence and biases in natural language species descriptions. Our results indicate that taxonomists use domain-specific terminology that follows taxonomic specialization, particularly at superfamily and family level groupings and that the developed Proofer tool is effective for term discovery, facilitating ontology construction. Public Library of Science 2013-02-18 /pmc/articles/PMC3575469/ /pubmed/23441153 http://dx.doi.org/10.1371/journal.pone.0055674 Text en © 2013 Seltmann et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Seltmann, Katja C.
Pénzes, Zsolt
Yoder, Matthew J.
Bertone, Matthew A.
Deans, Andrew R.
Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title_full Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title_fullStr Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title_full_unstemmed Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title_short Utilizing Descriptive Statements from the Biodiversity Heritage Library to Expand the Hymenoptera Anatomy Ontology
title_sort utilizing descriptive statements from the biodiversity heritage library to expand the hymenoptera anatomy ontology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575469/
https://www.ncbi.nlm.nih.gov/pubmed/23441153
http://dx.doi.org/10.1371/journal.pone.0055674
work_keys_str_mv AT seltmannkatjac utilizingdescriptivestatementsfromthebiodiversityheritagelibrarytoexpandthehymenopteraanatomyontology
AT penzeszsolt utilizingdescriptivestatementsfromthebiodiversityheritagelibrarytoexpandthehymenopteraanatomyontology
AT yodermatthewj utilizingdescriptivestatementsfromthebiodiversityheritagelibrarytoexpandthehymenopteraanatomyontology
AT bertonematthewa utilizingdescriptivestatementsfromthebiodiversityheritagelibrarytoexpandthehymenopteraanatomyontology
AT deansandrewr utilizingdescriptivestatementsfromthebiodiversityheritagelibrarytoexpandthehymenopteraanatomyontology