Cargando…

A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set

The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools...

Descripción completa

Detalles Bibliográficos
Autores principales:	Muzaffar, Abdul Wahab, Azam, Farooque, Qamar, Usman
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4546954/ https://www.ncbi.nlm.nih.gov/pubmed/26347797 http://dx.doi.org/10.1155/2015/910423

_version_	1782387005787734016
author	Muzaffar, Abdul Wahab Azam, Farooque Qamar, Usman
author_facet	Muzaffar, Abdul Wahab Azam, Farooque Qamar, Usman
author_sort	Muzaffar, Abdul Wahab
collection	PubMed
description	The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools and techniques for information extraction in biomedical text mining. Relation extraction is a significant area under biomedical information extraction that has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction focusing on rule-based and machine learning techniques. In the last decade, the focus has changed to hybrid approaches showing better results. This research presents a hybrid feature set for classification of relations between biomedical entities. The main contribution of this research is done in the semantic feature set where verb phrases are ranked using Unified Medical Language System (UMLS) and a ranking algorithm. Support Vector Machine and Naïve Bayes, the two effective machine learning techniques, are used to classify these relations. Our approach has been validated on the standard biomedical text corpus obtained from MEDLINE 2001. Conclusively, it can be articulated that our framework outperforms all state-of-the-art approaches used for relation extraction on the same corpus.
format	Online Article Text
id	pubmed-4546954
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-45469542015-09-07 A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set Muzaffar, Abdul Wahab Azam, Farooque Qamar, Usman Comput Math Methods Med Research Article The information extraction from unstructured text segments is a complex task. Although manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually because of the exponential increase in data size. Thus, there is a need for automatic tools and techniques for information extraction in biomedical text mining. Relation extraction is a significant area under biomedical information extraction that has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction focusing on rule-based and machine learning techniques. In the last decade, the focus has changed to hybrid approaches showing better results. This research presents a hybrid feature set for classification of relations between biomedical entities. The main contribution of this research is done in the semantic feature set where verb phrases are ranked using Unified Medical Language System (UMLS) and a ranking algorithm. Support Vector Machine and Naïve Bayes, the two effective machine learning techniques, are used to classify these relations. Our approach has been validated on the standard biomedical text corpus obtained from MEDLINE 2001. Conclusively, it can be articulated that our framework outperforms all state-of-the-art approaches used for relation extraction on the same corpus. Hindawi Publishing Corporation 2015 2015-08-10 /pmc/articles/PMC4546954/ /pubmed/26347797 http://dx.doi.org/10.1155/2015/910423 Text en Copyright © 2015 Abdul Wahab Muzaffar et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Muzaffar, Abdul Wahab Azam, Farooque Qamar, Usman A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title	A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title_full	A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title_fullStr	A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title_full_unstemmed	A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title_short	A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set
title_sort	relation extraction framework for biomedical text using hybrid feature set
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4546954/ https://www.ncbi.nlm.nih.gov/pubmed/26347797 http://dx.doi.org/10.1155/2015/910423
work_keys_str_mv	AT muzaffarabdulwahab arelationextractionframeworkforbiomedicaltextusinghybridfeatureset AT azamfarooque arelationextractionframeworkforbiomedicaltextusinghybridfeatureset AT qamarusman arelationextractionframeworkforbiomedicaltextusinghybridfeatureset AT muzaffarabdulwahab relationextractionframeworkforbiomedicaltextusinghybridfeatureset AT azamfarooque relationextractionframeworkforbiomedicaltextusinghybridfeatureset AT qamarusman relationextractionframeworkforbiomedicaltextusinghybridfeatureset

A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set

Ejemplares similares