Cargando…

HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports

OBJECTIVE: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting...

Descripción completa

Detalles Bibliográficos
Autores principales: Freifeld, Clark C., Mandl, Kenneth D., Reis, Ben Y., Brownstein, John S.
Formato: Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2274789/
https://www.ncbi.nlm.nih.gov/pubmed/18096908
http://dx.doi.org/10.1197/jamia.M2544
_version_ 1782151801459441664
author Freifeld, Clark C.
Mandl, Kenneth D.
Reis, Ben Y.
Brownstein, John S.
author_facet Freifeld, Clark C.
Mandl, Kenneth D.
Reis, Ben Y.
Brownstein, John S.
author_sort Freifeld, Clark C.
collection PubMed
description OBJECTIVE: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks. DESIGN: This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans. MEASUREMENTS: We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage. RESULTS: As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure. CONCLUSION: HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface.
format Text
id pubmed-2274789
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-22747892008-09-01 HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports Freifeld, Clark C. Mandl, Kenneth D. Reis, Ben Y. Brownstein, John S. J Am Med Inform Assoc Model Formulation OBJECTIVE: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks. DESIGN: This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans. MEASUREMENTS: We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage. RESULTS: As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure. CONCLUSION: HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface. American Medical Informatics Association 2008 /pmc/articles/PMC2274789/ /pubmed/18096908 http://dx.doi.org/10.1197/jamia.M2544 Text en Copyright © 2008, American Medical Informatics Association
spellingShingle Model Formulation
Freifeld, Clark C.
Mandl, Kenneth D.
Reis, Ben Y.
Brownstein, John S.
HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title_full HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title_fullStr HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title_full_unstemmed HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title_short HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports
title_sort healthmap: global infectious disease monitoring through automated classification and visualization of internet media reports
topic Model Formulation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2274789/
https://www.ncbi.nlm.nih.gov/pubmed/18096908
http://dx.doi.org/10.1197/jamia.M2544
work_keys_str_mv AT freifeldclarkc healthmapglobalinfectiousdiseasemonitoringthroughautomatedclassificationandvisualizationofinternetmediareports
AT mandlkennethd healthmapglobalinfectiousdiseasemonitoringthroughautomatedclassificationandvisualizationofinternetmediareports
AT reisbeny healthmapglobalinfectiousdiseasemonitoringthroughautomatedclassificationandvisualizationofinternetmediareports
AT brownsteinjohns healthmapglobalinfectiousdiseasemonitoringthroughautomatedclassificationandvisualizationofinternetmediareports