Cargando…

Polar labeling: silver standard algorithm for training disease classifiers

MOTIVATION: Expert-labeled data are essential to train phenotyping algorithms for cohort identification. However expert labeling is time and labor intensive, and the costs remain prohibitive for scaling phenotyping to wider use-cases. RESULTS: We present an approach referred to as polar labeling (PL...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wagholikar, Kavishwar B, Estiri, Hossein, Murphy, Marykate, Murphy, Shawn N
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2020
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214041/ https://www.ncbi.nlm.nih.gov/pubmed/32049335 http://dx.doi.org/10.1093/bioinformatics/btaa088

_version_	1783531900725362688
author	Wagholikar, Kavishwar B Estiri, Hossein Murphy, Marykate Murphy, Shawn N
author_facet	Wagholikar, Kavishwar B Estiri, Hossein Murphy, Marykate Murphy, Shawn N
author_sort	Wagholikar, Kavishwar B
collection	PubMed
description	MOTIVATION: Expert-labeled data are essential to train phenotyping algorithms for cohort identification. However expert labeling is time and labor intensive, and the costs remain prohibitive for scaling phenotyping to wider use-cases. RESULTS: We present an approach referred to as polar labeling (PL), to create silver standard for training machine learning (ML) for disease classification. We test the hypothesis that ML models trained on the silver standard created by applying PL on unlabeled patient records, are comparable in performance to the ML models trained on gold standard, created by clinical experts through manual review of patient records. We perform experimental validation using health records of 38 023 patients spanning six diseases. Our results demonstrate the superior performance of the proposed approach. AVAILABILITY AND IMPLEMENTATION: We provide a Python implementation of the algorithm and the Python code developed for this study on Github. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-7214041
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-72140412020-05-15 Polar labeling: silver standard algorithm for training disease classifiers Wagholikar, Kavishwar B Estiri, Hossein Murphy, Marykate Murphy, Shawn N Bioinformatics Original Papers MOTIVATION: Expert-labeled data are essential to train phenotyping algorithms for cohort identification. However expert labeling is time and labor intensive, and the costs remain prohibitive for scaling phenotyping to wider use-cases. RESULTS: We present an approach referred to as polar labeling (PL), to create silver standard for training machine learning (ML) for disease classification. We test the hypothesis that ML models trained on the silver standard created by applying PL on unlabeled patient records, are comparable in performance to the ML models trained on gold standard, created by clinical experts through manual review of patient records. We perform experimental validation using health records of 38 023 patients spanning six diseases. Our results demonstrate the superior performance of the proposed approach. AVAILABILITY AND IMPLEMENTATION: We provide a Python implementation of the algorithm and the Python code developed for this study on Github. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-05-15 2020-02-12 /pmc/articles/PMC7214041/ /pubmed/32049335 http://dx.doi.org/10.1093/bioinformatics/btaa088 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Wagholikar, Kavishwar B Estiri, Hossein Murphy, Marykate Murphy, Shawn N Polar labeling: silver standard algorithm for training disease classifiers
title	Polar labeling: silver standard algorithm for training disease classifiers
title_full	Polar labeling: silver standard algorithm for training disease classifiers
title_fullStr	Polar labeling: silver standard algorithm for training disease classifiers
title_full_unstemmed	Polar labeling: silver standard algorithm for training disease classifiers
title_short	Polar labeling: silver standard algorithm for training disease classifiers
title_sort	polar labeling: silver standard algorithm for training disease classifiers
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214041/ https://www.ncbi.nlm.nih.gov/pubmed/32049335 http://dx.doi.org/10.1093/bioinformatics/btaa088
work_keys_str_mv	AT wagholikarkavishwarb polarlabelingsilverstandardalgorithmfortrainingdiseaseclassifiers AT estirihossein polarlabelingsilverstandardalgorithmfortrainingdiseaseclassifiers AT murphymarykate polarlabelingsilverstandardalgorithmfortrainingdiseaseclassifiers AT murphyshawnn polarlabelingsilverstandardalgorithmfortrainingdiseaseclassifiers

Polar labeling: silver standard algorithm for training disease classifiers

Ejemplares similares