Cargando…

Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning

OBJECTIVE: Serum protein concentrations are diagnostically and prognostically valuable in cancer and other diseases, but their measurement via blood test is uncomfortable, inconvenient, and costly. This study investigates the possibility of predicting albumin, globulin, and albumin-globulin ratio fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Jing, Xiang, Jie, Yasin, Yousef, Barszczyk, Andrew, Wah, Deanne Tak On, Yu, Meifen, Huang, Wendy Wenyu, Feng, Zhong-Ping, Lee, Kang, Luo, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: West Asia Organization for Cancer Prevention 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190348/
https://www.ncbi.nlm.nih.gov/pubmed/33639645
http://dx.doi.org/10.31557/APJCP.2021.22.2.333
_version_ 1783705667695017984
author Wei, Jing
Xiang, Jie
Yasin, Yousef
Barszczyk, Andrew
Wah, Deanne Tak On
Yu, Meifen
Huang, Wendy Wenyu
Feng, Zhong-Ping
Lee, Kang
Luo, Hong
author_facet Wei, Jing
Xiang, Jie
Yasin, Yousef
Barszczyk, Andrew
Wah, Deanne Tak On
Yu, Meifen
Huang, Wendy Wenyu
Feng, Zhong-Ping
Lee, Kang
Luo, Hong
author_sort Wei, Jing
collection PubMed
description OBJECTIVE: Serum protein concentrations are diagnostically and prognostically valuable in cancer and other diseases, but their measurement via blood test is uncomfortable, inconvenient, and costly. This study investigates the possibility of predicting albumin, globulin, and albumin-globulin ratio from easily accessible physical characteristics (height, weight, Body Mass Index, age, gender) and vital signs (systolic blood pressure, diastolic blood pressure, mean arterial pressure, pulse pressure, pulse) using advanced machine learning techniques. METHODS: We obtained albumin concentration, globulin concentration, albumin-globulin ratio and predictor information (physical characteristics, vital signs) from physical exam records of 46,951 healthy adult participants in Hangzhou, China. We trained a computational model to predict each serum protein concentration from the predictors and then evaluated the predictive accuracy of each model on an independent portion of the dataset that was not used in model training. We also determined the relative importance of each feature within the model. RESULTS: Prediction accuracies were r=0.540 (95% CI: 0.539-0.540; Pearson r) for albumin, r=0.250 (95% CI: 0.249-0.251) for globulin, and r=0.373 (95% CI: 0.372-0.374) for albumin-globulin ratio. The most important predictive features were age (100% ± 0.0%; mean ± 95% CI of normalized importance), gender (34.4% ± 0.7%), pulse (25.6% ± 1.3%) and Body Mass Index (24.4% ± 2.3%) for albumin, pulse (83.7% ± 3.8%) for globulin, and age (99.2% ± 1.0%), gender (59.2% ± 1.7%), Body Mass Index (46.1% ± 4.2%) and height (40.0% ± 3.8%) for albumin-globulin ratio. CONCLUSIONS: Our models predicted serum protein concentrations with appreciable accuracy showing the promise of this approach. Such models could serve to augment existing tools for identifying “at-risk” individuals for follow-up with a blood test.
format Online
Article
Text
id pubmed-8190348
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher West Asia Organization for Cancer Prevention
record_format MEDLINE/PubMed
spelling pubmed-81903482021-06-11 Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning Wei, Jing Xiang, Jie Yasin, Yousef Barszczyk, Andrew Wah, Deanne Tak On Yu, Meifen Huang, Wendy Wenyu Feng, Zhong-Ping Lee, Kang Luo, Hong Asian Pac J Cancer Prev Research Article OBJECTIVE: Serum protein concentrations are diagnostically and prognostically valuable in cancer and other diseases, but their measurement via blood test is uncomfortable, inconvenient, and costly. This study investigates the possibility of predicting albumin, globulin, and albumin-globulin ratio from easily accessible physical characteristics (height, weight, Body Mass Index, age, gender) and vital signs (systolic blood pressure, diastolic blood pressure, mean arterial pressure, pulse pressure, pulse) using advanced machine learning techniques. METHODS: We obtained albumin concentration, globulin concentration, albumin-globulin ratio and predictor information (physical characteristics, vital signs) from physical exam records of 46,951 healthy adult participants in Hangzhou, China. We trained a computational model to predict each serum protein concentration from the predictors and then evaluated the predictive accuracy of each model on an independent portion of the dataset that was not used in model training. We also determined the relative importance of each feature within the model. RESULTS: Prediction accuracies were r=0.540 (95% CI: 0.539-0.540; Pearson r) for albumin, r=0.250 (95% CI: 0.249-0.251) for globulin, and r=0.373 (95% CI: 0.372-0.374) for albumin-globulin ratio. The most important predictive features were age (100% ± 0.0%; mean ± 95% CI of normalized importance), gender (34.4% ± 0.7%), pulse (25.6% ± 1.3%) and Body Mass Index (24.4% ± 2.3%) for albumin, pulse (83.7% ± 3.8%) for globulin, and age (99.2% ± 1.0%), gender (59.2% ± 1.7%), Body Mass Index (46.1% ± 4.2%) and height (40.0% ± 3.8%) for albumin-globulin ratio. CONCLUSIONS: Our models predicted serum protein concentrations with appreciable accuracy showing the promise of this approach. Such models could serve to augment existing tools for identifying “at-risk” individuals for follow-up with a blood test. West Asia Organization for Cancer Prevention 2021-02 /pmc/articles/PMC8190348/ /pubmed/33639645 http://dx.doi.org/10.31557/APJCP.2021.22.2.333 Text en https://creativecommons.org/licenses/by/3.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License, (http://creativecommons.org/licenses/by/3.0/ (https://creativecommons.org/licenses/by/3.0/) ) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wei, Jing
Xiang, Jie
Yasin, Yousef
Barszczyk, Andrew
Wah, Deanne Tak On
Yu, Meifen
Huang, Wendy Wenyu
Feng, Zhong-Ping
Lee, Kang
Luo, Hong
Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title_full Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title_fullStr Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title_full_unstemmed Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title_short Physical Features and Vital Signs Predict Serum Albumin and Globulin Concentrations Using Machine Learning
title_sort physical features and vital signs predict serum albumin and globulin concentrations using machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190348/
https://www.ncbi.nlm.nih.gov/pubmed/33639645
http://dx.doi.org/10.31557/APJCP.2021.22.2.333
work_keys_str_mv AT weijing physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT xiangjie physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT yasinyousef physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT barszczykandrew physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT wahdeannetakon physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT yumeifen physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT huangwendywenyu physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT fengzhongping physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT leekang physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning
AT luohong physicalfeaturesandvitalsignspredictserumalbuminandglobulinconcentrationsusingmachinelearning