Cargando…

A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

BACKGROUND: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoo, Illhoi, Hu, Xiaohua, Song, Il-Yeol
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217662/
https://www.ncbi.nlm.nih.gov/pubmed/18047705
http://dx.doi.org/10.1186/1471-2105-8-S9-S4
_version_ 1782149295379578880
author Yoo, Illhoi
Hu, Xiaohua
Song, Il-Yeol
author_facet Yoo, Illhoi
Hu, Xiaohua
Song, Il-Yeol
author_sort Yoo, Illhoi
collection PubMed
description BACKGROUND: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. RESULTS: Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. CONCLUSION: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.
format Text
id pubmed-2217662
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22176622008-01-31 A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method Yoo, Illhoi Hu, Xiaohua Song, Il-Yeol BMC Bioinformatics Proceedings BACKGROUND: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. RESULTS: Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. CONCLUSION: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries. BioMed Central 2007-11-27 /pmc/articles/PMC2217662/ /pubmed/18047705 http://dx.doi.org/10.1186/1471-2105-8-S9-S4 Text en Copyright © 2007 Yoo et al; licensee BioMed Central Ltd.
spellingShingle Proceedings
Yoo, Illhoi
Hu, Xiaohua
Song, Il-Yeol
A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title_full A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title_fullStr A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title_full_unstemmed A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title_short A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
title_sort coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217662/
https://www.ncbi.nlm.nih.gov/pubmed/18047705
http://dx.doi.org/10.1186/1471-2105-8-S9-S4
work_keys_str_mv AT yooillhoi acoherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod
AT huxiaohua acoherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod
AT songilyeol acoherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod
AT yooillhoi coherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod
AT huxiaohua coherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod
AT songilyeol coherentgraphbasedsemanticclusteringandsummarizationapproachforbiomedicalliteratureandanewsummarizationevaluationmethod