Cargando…

Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling

This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, se...

Descripción completa

Detalles Bibliográficos
Autores principales: Fusco, Terence, Bi, Yaxin, Wang, Haiying, Browne, Fiona
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7224118/
https://www.ncbi.nlm.nih.gov/pubmed/33727985
http://dx.doi.org/10.1007/s13042-019-01029-x
_version_ 1783533842338938880
author Fusco, Terence
Bi, Yaxin
Wang, Haiying
Browne, Fiona
author_facet Fusco, Terence
Bi, Yaxin
Wang, Haiying
Browne, Fiona
author_sort Fusco, Terence
collection PubMed
description This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised labelling and synthetic instance simulation when using sparse training data. Innovative semi-supervised ensemble learning paradigms are proposed focusing on labelling threshold selection and stringency of classification confidence levels. A regression-correlation combination (RCC) data imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputation precision improvement over benchmark value replacement using proposed RCC on 70% of test cases. Proposed novel incremental transductive models such as ITSVM have provided interesting findings based on threshold constraints outperforming standard SVM application on 21% of test cases and can be applied with alternative environment-based epidemic disease domains. The proposed incremental transductive ensemble approach model enables the combination of complimentary algorithms to provide labelling for unlabelled vector density instances. Liberal (LTA) and strict training approaches provided varied results with LTA outperforming Stacking ensemble on 29.1% of test cases. Proposed novel synthetic minority over-sampling technique (SMOTE) equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation.
format Online
Article
Text
id pubmed-7224118
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-72241182020-05-15 Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling Fusco, Terence Bi, Yaxin Wang, Haiying Browne, Fiona Int. J. Mach. Learn. & Cyber. Original Article This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised labelling and synthetic instance simulation when using sparse training data. Innovative semi-supervised ensemble learning paradigms are proposed focusing on labelling threshold selection and stringency of classification confidence levels. A regression-correlation combination (RCC) data imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputation precision improvement over benchmark value replacement using proposed RCC on 70% of test cases. Proposed novel incremental transductive models such as ITSVM have provided interesting findings based on threshold constraints outperforming standard SVM application on 21% of test cases and can be applied with alternative environment-based epidemic disease domains. The proposed incremental transductive ensemble approach model enables the combination of complimentary algorithms to provide labelling for unlabelled vector density instances. Liberal (LTA) and strict training approaches provided varied results with LTA outperforming Stacking ensemble on 29.1% of test cases. Proposed novel synthetic minority over-sampling technique (SMOTE) equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation. Springer Berlin Heidelberg 2019-11-18 2020 /pmc/articles/PMC7224118/ /pubmed/33727985 http://dx.doi.org/10.1007/s13042-019-01029-x Text en © Springer-Verlag GmbH Germany, part of Springer Nature 2019 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Fusco, Terence
Bi, Yaxin
Wang, Haiying
Browne, Fiona
Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title_full Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title_fullStr Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title_full_unstemmed Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title_short Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling
title_sort data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: epidemic disease prediction modelling
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7224118/
https://www.ncbi.nlm.nih.gov/pubmed/33727985
http://dx.doi.org/10.1007/s13042-019-01029-x
work_keys_str_mv AT fuscoterence dataminingandmachinelearningapproachesforpredictionmodellingofschistosomiasisdiseasevectorsepidemicdiseasepredictionmodelling
AT biyaxin dataminingandmachinelearningapproachesforpredictionmodellingofschistosomiasisdiseasevectorsepidemicdiseasepredictionmodelling
AT wanghaiying dataminingandmachinelearningapproachesforpredictionmodellingofschistosomiasisdiseasevectorsepidemicdiseasepredictionmodelling
AT brownefiona dataminingandmachinelearningapproachesforpredictionmodellingofschistosomiasisdiseasevectorsepidemicdiseasepredictionmodelling