Cargando…

Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra

BACKGROUND: Old mosquitoes are more likely to transmit malaria than young ones. Therefore, accurate prediction of mosquito population age can drastically improve the evaluation of mosquito-targeted interventions. However, standard methods for age-grading mosquitoes are laborious and costly. We have...

Descripción completa

Detalles Bibliográficos
Autores principales: Mwanga, Emmanuel P., Siria, Doreen J., Mitton, Joshua, Mshani, Issa H., González-Jiménez, Mario, Selvaraj, Prashanth, Wynne, Klaas, Baldini, Francesco, Okumu, Fredros O., Babayan, Simon A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9830685/
https://www.ncbi.nlm.nih.gov/pubmed/36624386
http://dx.doi.org/10.1186/s12859-022-05128-5
_version_ 1784867716334616576
author Mwanga, Emmanuel P.
Siria, Doreen J.
Mitton, Joshua
Mshani, Issa H.
González-Jiménez, Mario
Selvaraj, Prashanth
Wynne, Klaas
Baldini, Francesco
Okumu, Fredros O.
Babayan, Simon A.
author_facet Mwanga, Emmanuel P.
Siria, Doreen J.
Mitton, Joshua
Mshani, Issa H.
González-Jiménez, Mario
Selvaraj, Prashanth
Wynne, Klaas
Baldini, Francesco
Okumu, Fredros O.
Babayan, Simon A.
author_sort Mwanga, Emmanuel P.
collection PubMed
description BACKGROUND: Old mosquitoes are more likely to transmit malaria than young ones. Therefore, accurate prediction of mosquito population age can drastically improve the evaluation of mosquito-targeted interventions. However, standard methods for age-grading mosquitoes are laborious and costly. We have shown that Mid-infrared spectroscopy (MIRS) can be used to detect age-specific patterns in mosquito cuticles and thus can be used to train age-grading machine learning models. However, these models tend to transfer poorly across populations. Here, we investigate whether applying dimensionality reduction and transfer learning to MIRS data can improve the transferability of MIRS-based predictions for mosquito ages. METHODS: We reared adults of the malaria vector Anopheles arabiensis in two insectaries. The heads and thoraces of female mosquitoes were scanned using an attenuated total reflection-Fourier transform infrared spectrometer, which were grouped into two different age classes. The dimensionality of the spectra data was reduced using unsupervised principal component analysis or t-distributed stochastic neighbour embedding, and then used to train deep learning and standard machine learning classifiers. Transfer learning was also evaluated to improve transferability of the models when predicting mosquito age classes from new populations. RESULTS: Model accuracies for predicting the age of mosquitoes from the same population as the training samples reached 99% for deep learning and 92% for standard machine learning. However, these models did not generalise to a different population, achieving only 46% and 48% accuracy for deep learning and standard machine learning, respectively. Dimensionality reduction did not improve model generalizability but reduced computational time. Transfer learning by updating pre-trained models with 2% of mosquitoes from the alternate population improved performance to ~ 98% accuracy for predicting mosquito age classes in the alternative population. CONCLUSION: Combining dimensionality reduction and transfer learning can reduce computational costs and improve the transferability of both deep learning and standard machine learning models for predicting the age of mosquitoes. Future studies should investigate the optimal quantities and diversity of training data necessary for transfer learning and the implications for broader generalisability to unseen datasets.
format Online
Article
Text
id pubmed-9830685
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98306852023-01-11 Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra Mwanga, Emmanuel P. Siria, Doreen J. Mitton, Joshua Mshani, Issa H. González-Jiménez, Mario Selvaraj, Prashanth Wynne, Klaas Baldini, Francesco Okumu, Fredros O. Babayan, Simon A. BMC Bioinformatics Research BACKGROUND: Old mosquitoes are more likely to transmit malaria than young ones. Therefore, accurate prediction of mosquito population age can drastically improve the evaluation of mosquito-targeted interventions. However, standard methods for age-grading mosquitoes are laborious and costly. We have shown that Mid-infrared spectroscopy (MIRS) can be used to detect age-specific patterns in mosquito cuticles and thus can be used to train age-grading machine learning models. However, these models tend to transfer poorly across populations. Here, we investigate whether applying dimensionality reduction and transfer learning to MIRS data can improve the transferability of MIRS-based predictions for mosquito ages. METHODS: We reared adults of the malaria vector Anopheles arabiensis in two insectaries. The heads and thoraces of female mosquitoes were scanned using an attenuated total reflection-Fourier transform infrared spectrometer, which were grouped into two different age classes. The dimensionality of the spectra data was reduced using unsupervised principal component analysis or t-distributed stochastic neighbour embedding, and then used to train deep learning and standard machine learning classifiers. Transfer learning was also evaluated to improve transferability of the models when predicting mosquito age classes from new populations. RESULTS: Model accuracies for predicting the age of mosquitoes from the same population as the training samples reached 99% for deep learning and 92% for standard machine learning. However, these models did not generalise to a different population, achieving only 46% and 48% accuracy for deep learning and standard machine learning, respectively. Dimensionality reduction did not improve model generalizability but reduced computational time. Transfer learning by updating pre-trained models with 2% of mosquitoes from the alternate population improved performance to ~ 98% accuracy for predicting mosquito age classes in the alternative population. CONCLUSION: Combining dimensionality reduction and transfer learning can reduce computational costs and improve the transferability of both deep learning and standard machine learning models for predicting the age of mosquitoes. Future studies should investigate the optimal quantities and diversity of training data necessary for transfer learning and the implications for broader generalisability to unseen datasets. BioMed Central 2023-01-09 /pmc/articles/PMC9830685/ /pubmed/36624386 http://dx.doi.org/10.1186/s12859-022-05128-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Mwanga, Emmanuel P.
Siria, Doreen J.
Mitton, Joshua
Mshani, Issa H.
González-Jiménez, Mario
Selvaraj, Prashanth
Wynne, Klaas
Baldini, Francesco
Okumu, Fredros O.
Babayan, Simon A.
Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title_full Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title_fullStr Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title_full_unstemmed Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title_short Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
title_sort using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9830685/
https://www.ncbi.nlm.nih.gov/pubmed/36624386
http://dx.doi.org/10.1186/s12859-022-05128-5
work_keys_str_mv AT mwangaemmanuelp usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT siriadoreenj usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT mittonjoshua usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT mshaniissah usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT gonzalezjimenezmario usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT selvarajprashanth usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT wynneklaas usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT baldinifrancesco usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT okumufredroso usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra
AT babayansimona usingtransferlearninganddimensionalityreductiontechniquestoimprovegeneralisabilityofmachinelearningpredictionsofmosquitoagesfrommidinfraredspectra