Cargando…
Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning
Objective: We describe the development and evaluation of a system that uses machine learning and natural language processing techniques to identify potential candidates for surgical intervention for drug-resistant pediatric epilepsy. The data are comprised of free-text clinical notes extracted from...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4876984/ https://www.ncbi.nlm.nih.gov/pubmed/27257386 http://dx.doi.org/10.4137/BII.S38308 |
_version_ | 1782433311239438336 |
---|---|
author | Cohen, Kevin Bretonnel Glass, Benjamin Greiner, Hansel M. Holland-Bouley, Katherine Standridge, Shannon Arya, Ravindra Faist, Robert Morita, Diego Mangano, Francesco Connolly, Brian Glauser, Tracy Pestian, John |
author_facet | Cohen, Kevin Bretonnel Glass, Benjamin Greiner, Hansel M. Holland-Bouley, Katherine Standridge, Shannon Arya, Ravindra Faist, Robert Morita, Diego Mangano, Francesco Connolly, Brian Glauser, Tracy Pestian, John |
author_sort | Cohen, Kevin Bretonnel |
collection | PubMed |
description | Objective: We describe the development and evaluation of a system that uses machine learning and natural language processing techniques to identify potential candidates for surgical intervention for drug-resistant pediatric epilepsy. The data are comprised of free-text clinical notes extracted from the electronic health record (EHR). Both known clinical outcomes from the EHR and manual chart annotations provide gold standards for the patient’s status. The following hypotheses are then tested: 1) machine learning methods can identify epilepsy surgery candidates as well as physicians do and 2) machine learning methods can identify candidates earlier than physicians do. These hypotheses are tested by systematically evaluating the effects of the data source, amount of training data, class balance, classification algorithm, and feature set on classifier performance. The results support both hypotheses, with F-measures ranging from 0.71 to 0.82. The feature set, classification algorithm, amount of training data, class balance, and gold standard all significantly affected classification performance. It was further observed that classification performance was better than the highest agreement between two annotators, even at one year before documented surgery referral. The results demonstrate that such machine learning methods can contribute to predicting pediatric epilepsy surgery candidates and reducing lag time to surgery referral. |
format | Online Article Text |
id | pubmed-4876984 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-48769842016-06-02 Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning Cohen, Kevin Bretonnel Glass, Benjamin Greiner, Hansel M. Holland-Bouley, Katherine Standridge, Shannon Arya, Ravindra Faist, Robert Morita, Diego Mangano, Francesco Connolly, Brian Glauser, Tracy Pestian, John Biomed Inform Insights Original Research Objective: We describe the development and evaluation of a system that uses machine learning and natural language processing techniques to identify potential candidates for surgical intervention for drug-resistant pediatric epilepsy. The data are comprised of free-text clinical notes extracted from the electronic health record (EHR). Both known clinical outcomes from the EHR and manual chart annotations provide gold standards for the patient’s status. The following hypotheses are then tested: 1) machine learning methods can identify epilepsy surgery candidates as well as physicians do and 2) machine learning methods can identify candidates earlier than physicians do. These hypotheses are tested by systematically evaluating the effects of the data source, amount of training data, class balance, classification algorithm, and feature set on classifier performance. The results support both hypotheses, with F-measures ranging from 0.71 to 0.82. The feature set, classification algorithm, amount of training data, class balance, and gold standard all significantly affected classification performance. It was further observed that classification performance was better than the highest agreement between two annotators, even at one year before documented surgery referral. The results demonstrate that such machine learning methods can contribute to predicting pediatric epilepsy surgery candidates and reducing lag time to surgery referral. Libertas Academica 2016-05-22 /pmc/articles/PMC4876984/ /pubmed/27257386 http://dx.doi.org/10.4137/BII.S38308 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License. |
spellingShingle | Original Research Cohen, Kevin Bretonnel Glass, Benjamin Greiner, Hansel M. Holland-Bouley, Katherine Standridge, Shannon Arya, Ravindra Faist, Robert Morita, Diego Mangano, Francesco Connolly, Brian Glauser, Tracy Pestian, John Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title | Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title_full | Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title_fullStr | Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title_full_unstemmed | Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title_short | Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning |
title_sort | methodological issues in predicting pediatric epilepsy surgery candidates through natural language processing and machine learning |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4876984/ https://www.ncbi.nlm.nih.gov/pubmed/27257386 http://dx.doi.org/10.4137/BII.S38308 |
work_keys_str_mv | AT cohenkevinbretonnel methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT glassbenjamin methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT greinerhanselm methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT hollandbouleykatherine methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT standridgeshannon methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT aryaravindra methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT faistrobert methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT moritadiego methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT manganofrancesco methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT connollybrian methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT glausertracy methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning AT pestianjohn methodologicalissuesinpredictingpediatricepilepsysurgerycandidatesthroughnaturallanguageprocessingandmachinelearning |