Cargando…

Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles

Although figures in scientific articles have high information content and concisely communicate many key research findings, they are currently under utilized by literature search and retrieval systems. Many systems ignore figures, and those that do not typically only consider caption text. This stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Bockhorst, Joseph P., Conroy, John M., Agarwal, Shashank, O’Leary, Dianne P., Yu, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3399876/
https://www.ncbi.nlm.nih.gov/pubmed/22815711
http://dx.doi.org/10.1371/journal.pone.0039618
_version_ 1782238442289102848
author Bockhorst, Joseph P.
Conroy, John M.
Agarwal, Shashank
O’Leary, Dianne P.
Yu, Hong
author_facet Bockhorst, Joseph P.
Conroy, John M.
Agarwal, Shashank
O’Leary, Dianne P.
Yu, Hong
author_sort Bockhorst, Joseph P.
collection PubMed
description Although figures in scientific articles have high information content and concisely communicate many key research findings, they are currently under utilized by literature search and retrieval systems. Many systems ignore figures, and those that do not typically only consider caption text. This study describes and evaluates a fully automated approach for associating figures in the body of a biomedical article with sentences in its abstract. We use supervised methods to learn probabilistic language models, hidden Markov models, and conditional random fields for predicting associations between abstract sentences and figures. Three kinds of evidence are used: text in abstract sentences and figures, relative positions of sentences and figures, and the patterns of sentence/figure associations across an article. Each information source is shown to have predictive value, and models that use all kinds of evidence are more accurate than models that do not. Our most accurate method has an [Image: see text]-score of 69% on a cross-validation experiment, is competitive with the accuracy of human experts, has significantly better predictive accuracy than state-of-the-art methods and enables users to access figures associated with an abstract sentence with an average of 1.82 fewer mouse clicks. A user evaluation shows that human users find our system beneficial. The system is available at http://FigureItOut.askHERMES.org.
format Online
Article
Text
id pubmed-3399876
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33998762012-07-19 Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles Bockhorst, Joseph P. Conroy, John M. Agarwal, Shashank O’Leary, Dianne P. Yu, Hong PLoS One Research Article Although figures in scientific articles have high information content and concisely communicate many key research findings, they are currently under utilized by literature search and retrieval systems. Many systems ignore figures, and those that do not typically only consider caption text. This study describes and evaluates a fully automated approach for associating figures in the body of a biomedical article with sentences in its abstract. We use supervised methods to learn probabilistic language models, hidden Markov models, and conditional random fields for predicting associations between abstract sentences and figures. Three kinds of evidence are used: text in abstract sentences and figures, relative positions of sentences and figures, and the patterns of sentence/figure associations across an article. Each information source is shown to have predictive value, and models that use all kinds of evidence are more accurate than models that do not. Our most accurate method has an [Image: see text]-score of 69% on a cross-validation experiment, is competitive with the accuracy of human experts, has significantly better predictive accuracy than state-of-the-art methods and enables users to access figures associated with an abstract sentence with an average of 1.82 fewer mouse clicks. A user evaluation shows that human users find our system beneficial. The system is available at http://FigureItOut.askHERMES.org. Public Library of Science 2012-07-18 /pmc/articles/PMC3399876/ /pubmed/22815711 http://dx.doi.org/10.1371/journal.pone.0039618 Text en Bockhorst et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bockhorst, Joseph P.
Conroy, John M.
Agarwal, Shashank
O’Leary, Dianne P.
Yu, Hong
Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title_full Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title_fullStr Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title_full_unstemmed Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title_short Beyond Captions: Linking Figures with Abstract Sentences in Biomedical Articles
title_sort beyond captions: linking figures with abstract sentences in biomedical articles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3399876/
https://www.ncbi.nlm.nih.gov/pubmed/22815711
http://dx.doi.org/10.1371/journal.pone.0039618
work_keys_str_mv AT bockhorstjosephp beyondcaptionslinkingfigureswithabstractsentencesinbiomedicalarticles
AT conroyjohnm beyondcaptionslinkingfigureswithabstractsentencesinbiomedicalarticles
AT agarwalshashank beyondcaptionslinkingfigureswithabstractsentencesinbiomedicalarticles
AT olearydiannep beyondcaptionslinkingfigureswithabstractsentencesinbiomedicalarticles
AT yuhong beyondcaptionslinkingfigureswithabstractsentencesinbiomedicalarticles