Cargando…
A Fuzzy-Match Search Engine for Physician Directories
BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle miss...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Gunther Eysenbach
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288075/ https://www.ncbi.nlm.nih.gov/pubmed/25601050 http://dx.doi.org/10.2196/medinform.3463 |
_version_ | 1782351903967936512 |
---|---|
author | Rastegar-Mojarad, Majid Kadolph, Christopher Ye, Zhan Wall, Daniel Murali, Narayana Lin, Simon |
author_facet | Rastegar-Mojarad, Majid Kadolph, Christopher Ye, Zhan Wall, Daniel Murali, Narayana Lin, Simon |
author_sort | Rastegar-Mojarad, Majid |
collection | PubMed |
description | BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle misspellings and spelling variations of names is needed, as the United States (US) has culturally, racially, and ethnically diverse names. OBJECTIVE: The Marshfield Clinic website provides a search engine for users to search for physicians’ names. The current search engine provides an auto-completion function, but it requires an exact match. We observed that 26% of all searches yielded no results. The goal was to design a fuzzy-match algorithm to aid users in finding physicians easier and faster. METHODS: Instead of an exact match search, we used a fuzzy algorithm to find similar matches for searched terms. In the algorithm, we solved three types of search engine failures: “Typographic”, “Phonetic spelling variation”, and “Nickname”. To solve these mismatches, we used a customized Levenshtein distance calculation that incorporated Soundex coding and a lookup table of nicknames derived from US census data. RESULTS: Using the “Challenge Data Set of Marshfield Physician Names,” we evaluated the accuracy of fuzzy-match engine–top ten (90%) and compared it with exact match (0%), Soundex (24%), Levenshtein distance (59%), and fuzzy-match engine–top one (71%). CONCLUSIONS: We designed, created a reference implementation, and evaluated a fuzzy-match search engine for physician directories. The open-source code is available at the codeplex website and a reference implementation is available for demonstration at the datamarsh website. |
format | Online Article Text |
id | pubmed-4288075 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Gunther Eysenbach |
record_format | MEDLINE/PubMed |
spelling | pubmed-42880752015-01-15 A Fuzzy-Match Search Engine for Physician Directories Rastegar-Mojarad, Majid Kadolph, Christopher Ye, Zhan Wall, Daniel Murali, Narayana Lin, Simon JMIR Med Inform Original Paper BACKGROUND: A search engine to find physicians’ information is a basic but crucial function of a health care provider’s website. Inefficient search engines, which return no results or incorrect results, can lead to patient frustration and potential customer loss. A search engine that can handle misspellings and spelling variations of names is needed, as the United States (US) has culturally, racially, and ethnically diverse names. OBJECTIVE: The Marshfield Clinic website provides a search engine for users to search for physicians’ names. The current search engine provides an auto-completion function, but it requires an exact match. We observed that 26% of all searches yielded no results. The goal was to design a fuzzy-match algorithm to aid users in finding physicians easier and faster. METHODS: Instead of an exact match search, we used a fuzzy algorithm to find similar matches for searched terms. In the algorithm, we solved three types of search engine failures: “Typographic”, “Phonetic spelling variation”, and “Nickname”. To solve these mismatches, we used a customized Levenshtein distance calculation that incorporated Soundex coding and a lookup table of nicknames derived from US census data. RESULTS: Using the “Challenge Data Set of Marshfield Physician Names,” we evaluated the accuracy of fuzzy-match engine–top ten (90%) and compared it with exact match (0%), Soundex (24%), Levenshtein distance (59%), and fuzzy-match engine–top one (71%). CONCLUSIONS: We designed, created a reference implementation, and evaluated a fuzzy-match search engine for physician directories. The open-source code is available at the codeplex website and a reference implementation is available for demonstration at the datamarsh website. Gunther Eysenbach 2014-11-04 /pmc/articles/PMC4288075/ /pubmed/25601050 http://dx.doi.org/10.2196/medinform.3463 Text en ©Majid Rastegar-Mojarad, Christopher Kadolph, Zhan Ye, Daniel Wall, Narayana Murali, Simon Lin. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 04.11.2014. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Rastegar-Mojarad, Majid Kadolph, Christopher Ye, Zhan Wall, Daniel Murali, Narayana Lin, Simon A Fuzzy-Match Search Engine for Physician Directories |
title | A Fuzzy-Match Search Engine for Physician Directories |
title_full | A Fuzzy-Match Search Engine for Physician Directories |
title_fullStr | A Fuzzy-Match Search Engine for Physician Directories |
title_full_unstemmed | A Fuzzy-Match Search Engine for Physician Directories |
title_short | A Fuzzy-Match Search Engine for Physician Directories |
title_sort | fuzzy-match search engine for physician directories |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288075/ https://www.ncbi.nlm.nih.gov/pubmed/25601050 http://dx.doi.org/10.2196/medinform.3463 |
work_keys_str_mv | AT rastegarmojaradmajid afuzzymatchsearchengineforphysiciandirectories AT kadolphchristopher afuzzymatchsearchengineforphysiciandirectories AT yezhan afuzzymatchsearchengineforphysiciandirectories AT walldaniel afuzzymatchsearchengineforphysiciandirectories AT muralinarayana afuzzymatchsearchengineforphysiciandirectories AT linsimon afuzzymatchsearchengineforphysiciandirectories AT rastegarmojaradmajid fuzzymatchsearchengineforphysiciandirectories AT kadolphchristopher fuzzymatchsearchengineforphysiciandirectories AT yezhan fuzzymatchsearchengineforphysiciandirectories AT walldaniel fuzzymatchsearchengineforphysiciandirectories AT muralinarayana fuzzymatchsearchengineforphysiciandirectories AT linsimon fuzzymatchsearchengineforphysiciandirectories |