Cargando…

Semantic annotation of morphological descriptions: an overall strategy

BACKGROUND: Large volumes of morphological descriptions of whole organisms have been created as print or electronic text in a human-readable format. Converting the descriptions into computer- readable formats gives a new life to the valuable knowledge on biodiversity. Research in this area started 2...

Descripción completa

Detalles Bibliográficos
Autor principal: Cui, Hong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887808/
https://www.ncbi.nlm.nih.gov/pubmed/20500882
http://dx.doi.org/10.1186/1471-2105-11-278
_version_ 1782182588010463232
author Cui, Hong
author_facet Cui, Hong
author_sort Cui, Hong
collection PubMed
description BACKGROUND: Large volumes of morphological descriptions of whole organisms have been created as print or electronic text in a human-readable format. Converting the descriptions into computer- readable formats gives a new life to the valuable knowledge on biodiversity. Research in this area started 20 years ago, yet not sufficient progress has been made to produce an automated system that requires only minimal human intervention but works on descriptions of various plant and animal groups. This paper attempts to examine the hindering factors by identifying the mismatches between existing research and the characteristics of morphological descriptions. RESULTS: This paper reviews the techniques that have been used for automated annotation, reports exploratory results on characteristics of morphological descriptions as a genre, and identifies challenges facing automated annotation systems. Based on these criteria, the paper proposes an overall strategy for converting descriptions of various taxon groups with the least human effort. CONCLUSIONS: A combined unsupervised and supervised machine learning strategy is needed to construct domain ontologies and lexicons and to ultimately achieve automated semantic annotation of morphological descriptions. Further, we suggest that each effort in creating a new description or annotating an individual description collection should be shared and contribute to the "biodiversity information commons" for the Semantic Web. This cannot be done without a sound strategy and a close partnership between and among information scientists and biologists.
format Text
id pubmed-2887808
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28878082010-06-19 Semantic annotation of morphological descriptions: an overall strategy Cui, Hong BMC Bioinformatics Research article BACKGROUND: Large volumes of morphological descriptions of whole organisms have been created as print or electronic text in a human-readable format. Converting the descriptions into computer- readable formats gives a new life to the valuable knowledge on biodiversity. Research in this area started 20 years ago, yet not sufficient progress has been made to produce an automated system that requires only minimal human intervention but works on descriptions of various plant and animal groups. This paper attempts to examine the hindering factors by identifying the mismatches between existing research and the characteristics of morphological descriptions. RESULTS: This paper reviews the techniques that have been used for automated annotation, reports exploratory results on characteristics of morphological descriptions as a genre, and identifies challenges facing automated annotation systems. Based on these criteria, the paper proposes an overall strategy for converting descriptions of various taxon groups with the least human effort. CONCLUSIONS: A combined unsupervised and supervised machine learning strategy is needed to construct domain ontologies and lexicons and to ultimately achieve automated semantic annotation of morphological descriptions. Further, we suggest that each effort in creating a new description or annotating an individual description collection should be shared and contribute to the "biodiversity information commons" for the Semantic Web. This cannot be done without a sound strategy and a close partnership between and among information scientists and biologists. BioMed Central 2010-05-25 /pmc/articles/PMC2887808/ /pubmed/20500882 http://dx.doi.org/10.1186/1471-2105-11-278 Text en Copyright ©2010 Cui; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Cui, Hong
Semantic annotation of morphological descriptions: an overall strategy
title Semantic annotation of morphological descriptions: an overall strategy
title_full Semantic annotation of morphological descriptions: an overall strategy
title_fullStr Semantic annotation of morphological descriptions: an overall strategy
title_full_unstemmed Semantic annotation of morphological descriptions: an overall strategy
title_short Semantic annotation of morphological descriptions: an overall strategy
title_sort semantic annotation of morphological descriptions: an overall strategy
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887808/
https://www.ncbi.nlm.nih.gov/pubmed/20500882
http://dx.doi.org/10.1186/1471-2105-11-278
work_keys_str_mv AT cuihong semanticannotationofmorphologicaldescriptionsanoverallstrategy