Cargando…

A Fuzzy-Match Search Engine for Physician Directories

BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle miss...

Descripción completa

Detalles Bibliográficos
Autores principales: Rastegar-Mojarad, Majid, Kadolph, Christopher, Ye, Zhan, Wall, Daniel, Murali, Narayana, Lin, Simon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Gunther Eysenbach 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288075/
https://www.ncbi.nlm.nih.gov/pubmed/25601050
http://dx.doi.org/10.2196/medinform.3463
_version_ 1782351903967936512
author Rastegar-Mojarad, Majid
Kadolph, Christopher
Ye, Zhan
Wall, Daniel
Murali, Narayana
Lin, Simon
author_facet Rastegar-Mojarad, Majid
Kadolph, Christopher
Ye, Zhan
Wall, Daniel
Murali, Narayana
Lin, Simon
author_sort Rastegar-Mojarad, Majid
collection PubMed
description BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle misspellings and spelling variations of names is needed, as the United States (US) has culturally, racially, and ethnically diverse names. OBJECTIVE: The Marshfield Clinic website provides a search engine for users to search for physicians’ names. The current search engine provides an auto-completion function, but it requires an exact match. We observed that 26% of all searches yielded no results. The goal was to design a fuzzy-match algorithm to aid users in finding physicians easier and faster. METHODS: Instead of an exact match search, we used a fuzzy algorithm to find similar matches for searched terms. In the algorithm, we solved three types of search engine failures: “Typographic”, “Phonetic spelling variation”, and “Nickname”. To solve these mismatches, we used a customized Levenshtein distance calculation that incorporated Soundex coding and a lookup table of nicknames derived from US census data. RESULTS: Using the “Challenge Data Set of Marshfield Physician Names,” we evaluated the accuracy of fuzzy-match engine–top ten (90%) and compared it with exact match (0%), Soundex (24%), Levenshtein distance (59%), and fuzzy-match engine–top one (71%). CONCLUSIONS: We designed, created a reference implementation, and evaluated a fuzzy-match search engine for physician directories. The open-source code is available at the codeplex website and a reference implementation is available for demonstration at the datamarsh website.
format Online
Article
Text
id pubmed-4288075
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Gunther Eysenbach
record_format MEDLINE/PubMed
spelling pubmed-42880752015-01-15 A Fuzzy-Match Search Engine for Physician Directories Rastegar-Mojarad, Majid Kadolph, Christopher Ye, Zhan Wall, Daniel Murali, Narayana Lin, Simon JMIR Med Inform Original Paper BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle misspellings and spelling variations of names is needed, as the United States (US) has culturally, racially, and ethnically diverse names. OBJECTIVE: The Marshfield Clinic website provides a search engine for users to search for physicians’ names. The current search engine provides an auto-completion function, but it requires an exact match. We observed that 26% of all searches yielded no results. The goal was to design a fuzzy-match algorithm to aid users in finding physicians easier and faster. METHODS: Instead of an exact match search, we used a fuzzy algorithm to find similar matches for searched terms. In the algorithm, we solved three types of search engine failures: “Typographic”, “Phonetic spelling variation”, and “Nickname”. To solve these mismatches, we used a customized Levenshtein distance calculation that incorporated Soundex coding and a lookup table of nicknames derived from US census data. RESULTS: Using the “Challenge Data Set of Marshfield Physician Names,” we evaluated the accuracy of fuzzy-match engine–top ten (90%) and compared it with exact match (0%), Soundex (24%), Levenshtein distance (59%), and fuzzy-match engine–top one (71%). CONCLUSIONS: We designed, created a reference implementation, and evaluated a fuzzy-match search engine for physician directories. The open-source code is available at the codeplex website and a reference implementation is available for demonstration at the datamarsh website. Gunther Eysenbach 2014-11-04 /pmc/articles/PMC4288075/ /pubmed/25601050 http://dx.doi.org/10.2196/medinform.3463 Text en ©Majid Rastegar-Mojarad, Christopher Kadolph, Zhan Ye, Daniel Wall, Narayana Murali, Simon Lin. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 04.11.2014. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Rastegar-Mojarad, Majid
Kadolph, Christopher
Ye, Zhan
Wall, Daniel
Murali, Narayana
Lin, Simon
A Fuzzy-Match Search Engine for Physician Directories
title A Fuzzy-Match Search Engine for Physician Directories
title_full A Fuzzy-Match Search Engine for Physician Directories
title_fullStr A Fuzzy-Match Search Engine for Physician Directories
title_full_unstemmed A Fuzzy-Match Search Engine for Physician Directories
title_short A Fuzzy-Match Search Engine for Physician Directories
title_sort fuzzy-match search engine for physician directories
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288075/
https://www.ncbi.nlm.nih.gov/pubmed/25601050
http://dx.doi.org/10.2196/medinform.3463
work_keys_str_mv AT rastegarmojaradmajid afuzzymatchsearchengineforphysiciandirectories
AT kadolphchristopher afuzzymatchsearchengineforphysiciandirectories
AT yezhan afuzzymatchsearchengineforphysiciandirectories
AT walldaniel afuzzymatchsearchengineforphysiciandirectories
AT muralinarayana afuzzymatchsearchengineforphysiciandirectories
AT linsimon afuzzymatchsearchengineforphysiciandirectories
AT rastegarmojaradmajid fuzzymatchsearchengineforphysiciandirectories
AT kadolphchristopher fuzzymatchsearchengineforphysiciandirectories
AT yezhan fuzzymatchsearchengineforphysiciandirectories
AT walldaniel fuzzymatchsearchengineforphysiciandirectories
AT muralinarayana fuzzymatchsearchengineforphysiciandirectories
AT linsimon fuzzymatchsearchengineforphysiciandirectories