Cargando…

An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation

Tomato yellow leaf curl virus (TYLCV) dispersed across different countries, specifically to subtropical regions, associated with more severe symptoms. Since TYLCV was first isolated in 1931, it has been a menace to tomato industrial production worldwide over the past century. Three groups were newly...

Descripción completa

Detalles Bibliográficos
Autores principales: Bupi, Nattanong, Sangaraju, Vinoth Kumar, Phan, Le Thi, Lal, Aamir, Vo, Thuy Thi Bich, Ho, Phuong Thi, Qureshi, Muhammad Amir, Tabassum, Marjia, Lee, Sukchan, Manavalan, Balachandran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AAAS 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10013792/
https://www.ncbi.nlm.nih.gov/pubmed/36930763
http://dx.doi.org/10.34133/research.0016
_version_ 1784906853080104960
author Bupi, Nattanong
Sangaraju, Vinoth Kumar
Phan, Le Thi
Lal, Aamir
Vo, Thuy Thi Bich
Ho, Phuong Thi
Qureshi, Muhammad Amir
Tabassum, Marjia
Lee, Sukchan
Manavalan, Balachandran
author_facet Bupi, Nattanong
Sangaraju, Vinoth Kumar
Phan, Le Thi
Lal, Aamir
Vo, Thuy Thi Bich
Ho, Phuong Thi
Qureshi, Muhammad Amir
Tabassum, Marjia
Lee, Sukchan
Manavalan, Balachandran
author_sort Bupi, Nattanong
collection PubMed
description Tomato yellow leaf curl virus (TYLCV) dispersed across different countries, specifically to subtropical regions, associated with more severe symptoms. Since TYLCV was first isolated in 1931, it has been a menace to tomato industrial production worldwide over the past century. Three groups were newly isolated from TYLCV-resistant tomatoes in 2022; however, their functions are unknown. The development of machine learning (ML)-based models using characterized sequences and evaluating blind predictions is one of the major challenges in interdisciplinary research. The purpose of this study was to develop an integrated computational framework for the accurate identification of symptoms (mild or severe) based on TYLCV sequences (isolated in Korea). For the development of the framework, we first extracted 11 different feature encodings and hybrid features from the training data and then explored 8 different classifiers and developed their respective prediction models by using randomized 10-fold cross-validation. Subsequently, we carried out a systematic evaluation of these 96 developed models and selected the top 90 models, whose predicted class labels were combined and considered as reduced features. On the basis of these features, a multilayer perceptron was applied and developed the final prediction model (IML-TYLCVs). We conducted blind prediction on 3 groups using IML-TYLCVs, and the results indicated that 2 groups were severe and 1 group was mild. Furthermore, we confirmed the prediction with virus-challenging experiments of tomato plant phenotypes using infectious clones from 3 groups. Plant virologists and plant breeding professionals can access the user-friendly online IML-TYLCVs web server at https://balalab-skku.org/IML-TYLCVs, which can guide them in developing new protection strategies for newly emerging viruses.
format Online
Article
Text
id pubmed-10013792
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher AAAS
record_format MEDLINE/PubMed
spelling pubmed-100137922023-03-15 An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation Bupi, Nattanong Sangaraju, Vinoth Kumar Phan, Le Thi Lal, Aamir Vo, Thuy Thi Bich Ho, Phuong Thi Qureshi, Muhammad Amir Tabassum, Marjia Lee, Sukchan Manavalan, Balachandran Research (Wash D C) Research Article Tomato yellow leaf curl virus (TYLCV) dispersed across different countries, specifically to subtropical regions, associated with more severe symptoms. Since TYLCV was first isolated in 1931, it has been a menace to tomato industrial production worldwide over the past century. Three groups were newly isolated from TYLCV-resistant tomatoes in 2022; however, their functions are unknown. The development of machine learning (ML)-based models using characterized sequences and evaluating blind predictions is one of the major challenges in interdisciplinary research. The purpose of this study was to develop an integrated computational framework for the accurate identification of symptoms (mild or severe) based on TYLCV sequences (isolated in Korea). For the development of the framework, we first extracted 11 different feature encodings and hybrid features from the training data and then explored 8 different classifiers and developed their respective prediction models by using randomized 10-fold cross-validation. Subsequently, we carried out a systematic evaluation of these 96 developed models and selected the top 90 models, whose predicted class labels were combined and considered as reduced features. On the basis of these features, a multilayer perceptron was applied and developed the final prediction model (IML-TYLCVs). We conducted blind prediction on 3 groups using IML-TYLCVs, and the results indicated that 2 groups were severe and 1 group was mild. Furthermore, we confirmed the prediction with virus-challenging experiments of tomato plant phenotypes using infectious clones from 3 groups. Plant virologists and plant breeding professionals can access the user-friendly online IML-TYLCVs web server at https://balalab-skku.org/IML-TYLCVs, which can guide them in developing new protection strategies for newly emerging viruses. AAAS 2023-01-10 2023 /pmc/articles/PMC10013792/ /pubmed/36930763 http://dx.doi.org/10.34133/research.0016 Text en Copyright © 2023 Nattanong Bupi et al. https://creativecommons.org/licenses/by/4.0/Exclusive licensee Science and Technology Review Publishing House. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY 4.0) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Bupi, Nattanong
Sangaraju, Vinoth Kumar
Phan, Le Thi
Lal, Aamir
Vo, Thuy Thi Bich
Ho, Phuong Thi
Qureshi, Muhammad Amir
Tabassum, Marjia
Lee, Sukchan
Manavalan, Balachandran
An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title_full An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title_fullStr An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title_full_unstemmed An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title_short An Effective Integrated Machine Learning Framework for Identifying Severity of Tomato Yellow Leaf Curl Virus and Their Experimental Validation
title_sort effective integrated machine learning framework for identifying severity of tomato yellow leaf curl virus and their experimental validation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10013792/
https://www.ncbi.nlm.nih.gov/pubmed/36930763
http://dx.doi.org/10.34133/research.0016
work_keys_str_mv AT bupinattanong aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT sangarajuvinothkumar aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT phanlethi aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT lalaamir aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT vothuythibich aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT hophuongthi aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT qureshimuhammadamir aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT tabassummarjia aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT leesukchan aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT manavalanbalachandran aneffectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT bupinattanong effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT sangarajuvinothkumar effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT phanlethi effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT lalaamir effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT vothuythibich effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT hophuongthi effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT qureshimuhammadamir effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT tabassummarjia effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT leesukchan effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation
AT manavalanbalachandran effectiveintegratedmachinelearningframeworkforidentifyingseverityoftomatoyellowleafcurlvirusandtheirexperimentalvalidation