Cargando…

Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems

Natural language descriptions of organismal phenotypes, a principal object of study in biology, are abundant in the biological literature. Expressing these phenotypes as logical statements using ontologies would enable large-scale analysis on phenotypic information from diverse systems. However, con...

Descripción completa

Detalles Bibliográficos
Autores principales: Dahdul, Wasila, Manda, Prashanti, Cui, Hong, Balhoff, James P, Dececchi, T Alexander, Ibrahim, Nizar, Lapp, Hilmar, Vision, Todd, Mabee, Paula M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301375/
https://www.ncbi.nlm.nih.gov/pubmed/30576485
http://dx.doi.org/10.1093/database/bay110
_version_ 1783381821073915904
author Dahdul, Wasila
Manda, Prashanti
Cui, Hong
Balhoff, James P
Dececchi, T Alexander
Ibrahim, Nizar
Lapp, Hilmar
Vision, Todd
Mabee, Paula M
author_facet Dahdul, Wasila
Manda, Prashanti
Cui, Hong
Balhoff, James P
Dececchi, T Alexander
Ibrahim, Nizar
Lapp, Hilmar
Vision, Todd
Mabee, Paula M
author_sort Dahdul, Wasila
collection PubMed
description Natural language descriptions of organismal phenotypes, a principal object of study in biology, are abundant in the biological literature. Expressing these phenotypes as logical statements using ontologies would enable large-scale analysis on phenotypic information from diverse systems. However, considerable human effort is required to make these phenotype descriptions amenable to machine reasoning. Natural language processing tools have been developed to facilitate this task, and the training and evaluation of these tools depend on the availability of high quality, manually annotated gold standard data sets. We describe the development of an expert-curated gold standard data set of annotated phenotypes for evolutionary biology. The gold standard was developed for the curation of complex comparative phenotypes for the Phenoscape project. It was created by consensus among three curators and consists of entity–quality expressions of varying complexity. We use the gold standard to evaluate annotations created by human curators and those generated by the Semantic CharaParser tool. Using four annotation accuracy metrics that can account for any level of relationship between terms from two phenotype annotations, we found that machine–human consistency, or similarity, was significantly lower than inter-curator (human–human) consistency. Surprisingly, allowing curatorsaccess to external information did not significantly increase the similarity of their annotations to the gold standard or have a significant effect on inter-curator consistency. We found that the similarity of machine annotations to the gold standard increased after new relevant ontology terms had been added. Evaluation by the original authors of the character descriptions indicated that the gold standard annotations came closer to representing their intended meaning than did either the curator or machine annotations. These findings point toward ways to better design software to augment human curators and the use of the gold standard corpus will allow training and assessment of new tools to improve phenotype annotation accuracy at scale.
format Online
Article
Text
id pubmed-6301375
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63013752018-12-27 Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems Dahdul, Wasila Manda, Prashanti Cui, Hong Balhoff, James P Dececchi, T Alexander Ibrahim, Nizar Lapp, Hilmar Vision, Todd Mabee, Paula M Database (Oxford) Original Article Natural language descriptions of organismal phenotypes, a principal object of study in biology, are abundant in the biological literature. Expressing these phenotypes as logical statements using ontologies would enable large-scale analysis on phenotypic information from diverse systems. However, considerable human effort is required to make these phenotype descriptions amenable to machine reasoning. Natural language processing tools have been developed to facilitate this task, and the training and evaluation of these tools depend on the availability of high quality, manually annotated gold standard data sets. We describe the development of an expert-curated gold standard data set of annotated phenotypes for evolutionary biology. The gold standard was developed for the curation of complex comparative phenotypes for the Phenoscape project. It was created by consensus among three curators and consists of entity–quality expressions of varying complexity. We use the gold standard to evaluate annotations created by human curators and those generated by the Semantic CharaParser tool. Using four annotation accuracy metrics that can account for any level of relationship between terms from two phenotype annotations, we found that machine–human consistency, or similarity, was significantly lower than inter-curator (human–human) consistency. Surprisingly, allowing curatorsaccess to external information did not significantly increase the similarity of their annotations to the gold standard or have a significant effect on inter-curator consistency. We found that the similarity of machine annotations to the gold standard increased after new relevant ontology terms had been added. Evaluation by the original authors of the character descriptions indicated that the gold standard annotations came closer to representing their intended meaning than did either the curator or machine annotations. These findings point toward ways to better design software to augment human curators and the use of the gold standard corpus will allow training and assessment of new tools to improve phenotype annotation accuracy at scale. Oxford University Press 2018-12-14 /pmc/articles/PMC6301375/ /pubmed/30576485 http://dx.doi.org/10.1093/database/bay110 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Dahdul, Wasila
Manda, Prashanti
Cui, Hong
Balhoff, James P
Dececchi, T Alexander
Ibrahim, Nizar
Lapp, Hilmar
Vision, Todd
Mabee, Paula M
Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title_full Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title_fullStr Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title_full_unstemmed Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title_short Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
title_sort annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301375/
https://www.ncbi.nlm.nih.gov/pubmed/30576485
http://dx.doi.org/10.1093/database/bay110
work_keys_str_mv AT dahdulwasila annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT mandaprashanti annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT cuihong annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT balhoffjamesp annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT dececchitalexander annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT ibrahimnizar annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT lapphilmar annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT visiontodd annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems
AT mabeepaulam annotationofphenotypesusingontologiesagoldstandardforthetrainingandevaluationofnaturallanguageprocessingsystems