Cargando…
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. O...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications Inc.
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668606/ https://www.ncbi.nlm.nih.gov/pubmed/23702487 http://dx.doi.org/10.2196/jmir.2495 |
_version_ | 1782271642570850304 |
---|---|
author | Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei |
author_facet | Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei |
author_sort | Chen, Wei-Hsin |
collection | PubMed |
description | BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. OBJECTIVE: The objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. METHODS: The framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. RESULTS: The feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. CONCLUSIONS: This SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically. |
format | Online Article Text |
id | pubmed-3668606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | JMIR Publications Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-36686062013-06-03 Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei J Med Internet Res Original Paper BACKGROUND: A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. OBJECTIVE: The objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. METHODS: The framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. RESULTS: The feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. CONCLUSIONS: This SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically. JMIR Publications Inc. 2013-05-23 /pmc/articles/PMC3668606/ /pubmed/23702487 http://dx.doi.org/10.2196/jmir.2495 Text en ©Wei-Hsin Chen, Sheau-Ling Hsieh, Kai-Ping Hsu, Han-Ping Chen, Xing-Yu Su, Yi-Ju Tseng, Yin-Hsiu Chien, Wuh-Liang Hwu, Feipei Lai. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 23.05.2013. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title | Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_full | Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_fullStr | Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_full_unstemmed | Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_short | Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_sort | web-based newborn screening system for metabolic diseases: machine learning versus clinicians |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668606/ https://www.ncbi.nlm.nih.gov/pubmed/23702487 http://dx.doi.org/10.2196/jmir.2495 |
work_keys_str_mv | AT chenweihsin webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hsiehsheauling webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hsukaiping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT chenhanping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT suxingyu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT tsengyiju webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT chienyinhsiu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hwuwuhliang webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT laifeipei webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians |