Cargando…

Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature

In the biomedical domain, authors publish their experiments and findings using a quasi-standard coarse-grained discourse structure, which starts with an introduction that sets up the motivation, continues with a description of the materials and methods, and concludes with results and discussions. Ov...

Descripción completa

Detalles Bibliográficos
Autor principal: Groza, Tudor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3832541/
https://www.ncbi.nlm.nih.gov/pubmed/24260252
http://dx.doi.org/10.1371/journal.pone.0079570
_version_ 1782291702365552640
author Groza, Tudor
author_facet Groza, Tudor
author_sort Groza, Tudor
collection PubMed
description In the biomedical domain, authors publish their experiments and findings using a quasi-standard coarse-grained discourse structure, which starts with an introduction that sets up the motivation, continues with a description of the materials and methods, and concludes with results and discussions. Over the course of the years, there has been a fair amount of research done in the area of scientific discourse analysis, with a focus on performing automatic recognition of scientific artefacts/conceptualisation zones from the raw content of scientific publications. Most of the existing approaches use Machine Learning techniques to perform classification based on features that rely on the shallow structure of the sentence tokens, or sentences as a whole, in addition to corpus-driven statistics. In this article, we investigate the role carried by the deep (dependency) structure of the sentences in describing their rhetorical nature. Using association rule mining techniques, we study the presence of dependency structure patterns in the context of a given rhetorical type, the use of these patterns in exploring differences in structure between the rhetorical types, and their ability to discriminate between the different rhetorical types. Our final goal is to provide a series of insights that can be used to complement existing classification approaches. Experimental results show that, in particular in the context of a fine-grained multi-class classification context, the association rules emerged from the dependency structure are not able to produce uniform classification results. However, they can be used to derive discriminative pair-wise classification mechanisms, in particular for some of the most ambiguous types.
format Online
Article
Text
id pubmed-3832541
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38325412013-11-20 Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature Groza, Tudor PLoS One Research Article In the biomedical domain, authors publish their experiments and findings using a quasi-standard coarse-grained discourse structure, which starts with an introduction that sets up the motivation, continues with a description of the materials and methods, and concludes with results and discussions. Over the course of the years, there has been a fair amount of research done in the area of scientific discourse analysis, with a focus on performing automatic recognition of scientific artefacts/conceptualisation zones from the raw content of scientific publications. Most of the existing approaches use Machine Learning techniques to perform classification based on features that rely on the shallow structure of the sentence tokens, or sentences as a whole, in addition to corpus-driven statistics. In this article, we investigate the role carried by the deep (dependency) structure of the sentences in describing their rhetorical nature. Using association rule mining techniques, we study the presence of dependency structure patterns in the context of a given rhetorical type, the use of these patterns in exploring differences in structure between the rhetorical types, and their ability to discriminate between the different rhetorical types. Our final goal is to provide a series of insights that can be used to complement existing classification approaches. Experimental results show that, in particular in the context of a fine-grained multi-class classification context, the association rules emerged from the dependency structure are not able to produce uniform classification results. However, they can be used to derive discriminative pair-wise classification mechanisms, in particular for some of the most ambiguous types. Public Library of Science 2013-11-18 /pmc/articles/PMC3832541/ /pubmed/24260252 http://dx.doi.org/10.1371/journal.pone.0079570 Text en © 2013 Tudor Groza http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Groza, Tudor
Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title_full Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title_fullStr Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title_full_unstemmed Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title_short Using Typed Dependencies to Study and Recognise Conceptualisation Zones in Biomedical Literature
title_sort using typed dependencies to study and recognise conceptualisation zones in biomedical literature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3832541/
https://www.ncbi.nlm.nih.gov/pubmed/24260252
http://dx.doi.org/10.1371/journal.pone.0079570
work_keys_str_mv AT grozatudor usingtypeddependenciestostudyandrecogniseconceptualisationzonesinbiomedicalliterature