Cargando…

Semi-automatic conversion of BioProp semantic annotation to PASBio annotation

BACKGROUND: Semantic role labeling (SRL) is an important text analysis technique. In SRL, sentences are represented by one or more predicate-argument structures (PAS). Each PAS is composed of a predicate (verb) and several arguments (noun phrases, adverbial phrases, etc.) with different semantic rol...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsai, Richard Tzong-Han, Dai, Hong-Jie, Huang, Chi-Hsin, Hsu, Wen-Lian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638158/
https://www.ncbi.nlm.nih.gov/pubmed/19091017
http://dx.doi.org/10.1186/1471-2105-9-S12-S18
_version_ 1782164398226276352
author Tsai, Richard Tzong-Han
Dai, Hong-Jie
Huang, Chi-Hsin
Hsu, Wen-Lian
author_facet Tsai, Richard Tzong-Han
Dai, Hong-Jie
Huang, Chi-Hsin
Hsu, Wen-Lian
author_sort Tsai, Richard Tzong-Han
collection PubMed
description BACKGROUND: Semantic role labeling (SRL) is an important text analysis technique. In SRL, sentences are represented by one or more predicate-argument structures (PAS). Each PAS is composed of a predicate (verb) and several arguments (noun phrases, adverbial phrases, etc.) with different semantic roles, including main arguments (agent or patient) as well as adjunct arguments (time, manner, or location). PropBank is the most widely used PAS corpus and annotation format in the newswire domain. In the biomedical field, however, more detailed and restrictive PAS annotation formats such as PASBio are popular. Unfortunately, due to the lack of an annotated PASBio corpus, no publicly available machine-learning (ML) based SRL systems based on PASBio have been developed. In previous work, we constructed a biomedical corpus based on the PropBank standard called BioProp, on which we developed an ML-based SRL system, BIOSMILE. In this paper, we aim to build a system to convert BIOSMILE's BioProp annotation output to PASBio annotation. Our system consists of BIOSMILE in combination with a BioProp-PASBio rule-based converter, and an additional semi-automatic rule generator. RESULTS: Our first experiment evaluated our rule-based converter's performance independently from BIOSMILE performance. The converter achieved an F-score of 85.29%. The second experiment evaluated combined system (BIOSMILE + rule-based converter). The system achieved an F-score of 69.08% for PASBio's 29 verbs. CONCLUSION: Our approach allows PAS conversion between BioProp and PASBio annotation using BIOSMILE alongside our newly developed semi-automatic rule generator and rule-based converter. Our system can match the performance of other state-of-the-art domain-specific ML-based SRL systems and can be easily customized for PASBio application development.
format Text
id pubmed-2638158
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26381582009-02-24 Semi-automatic conversion of BioProp semantic annotation to PASBio annotation Tsai, Richard Tzong-Han Dai, Hong-Jie Huang, Chi-Hsin Hsu, Wen-Lian BMC Bioinformatics Research BACKGROUND: Semantic role labeling (SRL) is an important text analysis technique. In SRL, sentences are represented by one or more predicate-argument structures (PAS). Each PAS is composed of a predicate (verb) and several arguments (noun phrases, adverbial phrases, etc.) with different semantic roles, including main arguments (agent or patient) as well as adjunct arguments (time, manner, or location). PropBank is the most widely used PAS corpus and annotation format in the newswire domain. In the biomedical field, however, more detailed and restrictive PAS annotation formats such as PASBio are popular. Unfortunately, due to the lack of an annotated PASBio corpus, no publicly available machine-learning (ML) based SRL systems based on PASBio have been developed. In previous work, we constructed a biomedical corpus based on the PropBank standard called BioProp, on which we developed an ML-based SRL system, BIOSMILE. In this paper, we aim to build a system to convert BIOSMILE's BioProp annotation output to PASBio annotation. Our system consists of BIOSMILE in combination with a BioProp-PASBio rule-based converter, and an additional semi-automatic rule generator. RESULTS: Our first experiment evaluated our rule-based converter's performance independently from BIOSMILE performance. The converter achieved an F-score of 85.29%. The second experiment evaluated combined system (BIOSMILE + rule-based converter). The system achieved an F-score of 69.08% for PASBio's 29 verbs. CONCLUSION: Our approach allows PAS conversion between BioProp and PASBio annotation using BIOSMILE alongside our newly developed semi-automatic rule generator and rule-based converter. Our system can match the performance of other state-of-the-art domain-specific ML-based SRL systems and can be easily customized for PASBio application development. BioMed Central 2008-12-12 /pmc/articles/PMC2638158/ /pubmed/19091017 http://dx.doi.org/10.1186/1471-2105-9-S12-S18 Text en Copyright © 2008 Tsai et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Tsai, Richard Tzong-Han
Dai, Hong-Jie
Huang, Chi-Hsin
Hsu, Wen-Lian
Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title_full Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title_fullStr Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title_full_unstemmed Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title_short Semi-automatic conversion of BioProp semantic annotation to PASBio annotation
title_sort semi-automatic conversion of bioprop semantic annotation to pasbio annotation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638158/
https://www.ncbi.nlm.nih.gov/pubmed/19091017
http://dx.doi.org/10.1186/1471-2105-9-S12-S18
work_keys_str_mv AT tsairichardtzonghan semiautomaticconversionofbiopropsemanticannotationtopasbioannotation
AT daihongjie semiautomaticconversionofbiopropsemanticannotationtopasbioannotation
AT huangchihsin semiautomaticconversionofbiopropsemanticannotationtopasbioannotation
AT hsuwenlian semiautomaticconversionofbiopropsemanticannotationtopasbioannotation