Cargando…

Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians

BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. O...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Wei-Hsin, Hsieh, Sheau-Ling, Hsu, Kai-Ping, Chen, Han-Ping, Su, Xing-Yu, Tseng, Yi-Ju, Chien, Yin-Hsiu, Hwu, Wuh-Liang, Lai, Feipei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications Inc. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668606/
https://www.ncbi.nlm.nih.gov/pubmed/23702487
http://dx.doi.org/10.2196/jmir.2495
_version_ 1782271642570850304
author Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
author_facet Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
author_sort Chen, Wei-Hsin
collection PubMed
description BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. OBJECTIVE: The objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. METHODS: The framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. RESULTS: The feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. CONCLUSIONS: This SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically.
format Online
Article
Text
id pubmed-3668606
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher JMIR Publications Inc.
record_format MEDLINE/PubMed
spelling pubmed-36686062013-06-03 Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei J Med Internet Res Original Paper BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. OBJECTIVE: The objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. METHODS: The framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. RESULTS: The feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. CONCLUSIONS: This SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically. JMIR Publications Inc. 2013-05-23 /pmc/articles/PMC3668606/ /pubmed/23702487 http://dx.doi.org/10.2196/jmir.2495 Text en ©Wei-Hsin Chen, Sheau-Ling Hsieh, Kai-Ping Hsu, Han-Ping Chen, Xing-Yu Su, Yi-Ju Tseng, Yin-Hsiu Chien, Wuh-Liang Hwu, Feipei Lai. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 23.05.2013. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_full Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_fullStr Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_full_unstemmed Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_short Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_sort web-based newborn screening system for metabolic diseases: machine learning versus clinicians
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668606/
https://www.ncbi.nlm.nih.gov/pubmed/23702487
http://dx.doi.org/10.2196/jmir.2495
work_keys_str_mv AT chenweihsin webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hsiehsheauling webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hsukaiping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT chenhanping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT suxingyu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT tsengyiju webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT chienyinhsiu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hwuwuhliang webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT laifeipei webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians