Cargando…

A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations

We present a maximum entropy-based system for identifying named entities (NEs) in biomedical abstracts and present its performance in the only two biomedical named entity recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained...

Descripción completa

Detalles Bibliográficos
Autores principales: Dingare, Shipra, Nissim, Malvina, Finkel, Jenny, Manning, Christopher, Grover, Claire
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2448599/
https://www.ncbi.nlm.nih.gov/pubmed/18629295
http://dx.doi.org/10.1002/cfg.457
_version_ 1782157178601209856
author Dingare, Shipra
Nissim, Malvina
Finkel, Jenny
Manning, Christopher
Grover, Claire
author_facet Dingare, Shipra
Nissim, Malvina
Finkel, Jenny
Manning, Christopher
Grover, Claire
author_sort Dingare, Shipra
collection PubMed
description We present a maximum entropy-based system for identifying named entities (NEs) in biomedical abstracts and present its performance in the only two biomedical named entity recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained an exact match F-score of 83.2% in the BioCreative evaluation and 70.1% in the BioNLP evaluation. We discuss our system in detail, including its rich use of local features, attention to correct boundary identification, innovative use of external knowledge resources, including parsing and web searches, and rapid adaptation to new NE sets. We also discuss in depth problems with data annotation in the evaluations which caused the final performance to be lower than optimal.
format Text
id pubmed-2448599
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-24485992008-07-14 A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations Dingare, Shipra Nissim, Malvina Finkel, Jenny Manning, Christopher Grover, Claire Comp Funct Genomics Research Article We present a maximum entropy-based system for identifying named entities (NEs) in biomedical abstracts and present its performance in the only two biomedical named entity recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained an exact match F-score of 83.2% in the BioCreative evaluation and 70.1% in the BioNLP evaluation. We discuss our system in detail, including its rich use of local features, attention to correct boundary identification, innovative use of external knowledge resources, including parsing and web searches, and rapid adaptation to new NE sets. We also discuss in depth problems with data annotation in the evaluations which caused the final performance to be lower than optimal. Hindawi Publishing Corporation 2005 /pmc/articles/PMC2448599/ /pubmed/18629295 http://dx.doi.org/10.1002/cfg.457 Text en Copyright © 2005 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dingare, Shipra
Nissim, Malvina
Finkel, Jenny
Manning, Christopher
Grover, Claire
A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title_full A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title_fullStr A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title_full_unstemmed A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title_short A System for Identifying Named Entities in Biomedical Text: how Results From two Evaluations Reflect on Both the System and the Evaluations
title_sort system for identifying named entities in biomedical text: how results from two evaluations reflect on both the system and the evaluations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2448599/
https://www.ncbi.nlm.nih.gov/pubmed/18629295
http://dx.doi.org/10.1002/cfg.457
work_keys_str_mv AT dingareshipra asystemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT nissimmalvina asystemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT finkeljenny asystemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT manningchristopher asystemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT groverclaire asystemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT dingareshipra systemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT nissimmalvina systemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT finkeljenny systemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT manningchristopher systemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations
AT groverclaire systemforidentifyingnamedentitiesinbiomedicaltexthowresultsfromtwoevaluationsreflectonboththesystemandtheevaluations