Cargando…

Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico

BACKGROUND: Huanglongbing (HLB, yellow shoot disease) is a highly destructive citrus disease associated with a nonculturable bacterium, “Candidatus Liberibacter asiaticus” (CLas), which is transmitted by Asian citrus psyllid (ACP, Diaphorina citri). In Mexico, HLB was first reported in Tizimin, Yuca...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Jiaquan, Alanís-Martínez, Iobana, Kumagai, Lucita, Dai, Zehan, Zheng, Zheng, Perez de Leon, Adalberto A., Chen, Jianchi, Deng, Xiaoling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798433/
https://www.ncbi.nlm.nih.gov/pubmed/36589083
http://dx.doi.org/10.3389/fpls.2022.1052680
_version_ 1784860908682477568
author Huang, Jiaquan
Alanís-Martínez, Iobana
Kumagai, Lucita
Dai, Zehan
Zheng, Zheng
Perez de Leon, Adalberto A.
Chen, Jianchi
Deng, Xiaoling
author_facet Huang, Jiaquan
Alanís-Martínez, Iobana
Kumagai, Lucita
Dai, Zehan
Zheng, Zheng
Perez de Leon, Adalberto A.
Chen, Jianchi
Deng, Xiaoling
author_sort Huang, Jiaquan
collection PubMed
description BACKGROUND: Huanglongbing (HLB, yellow shoot disease) is a highly destructive citrus disease associated with a nonculturable bacterium, “Candidatus Liberibacter asiaticus” (CLas), which is transmitted by Asian citrus psyllid (ACP, Diaphorina citri). In Mexico, HLB was first reported in Tizimin, Yucatán, in 2009 and is now endemic in 351 municipalities of 25 states. Understanding the population diversity of CLas is critical for HLB management. Current CLas diversity research is exclusively based on analysis of the bacterial genome, which composed two regions, chromosome (> 1,000 genes) and prophage (about 40 genes). METHODS AND RESULTS: In this study, 40 CLas-infected ACP samples from 20 states in Mexico were collected. CLas was detected and confirmed by PCR assays. A prophage gene(terL)-based typing system (TTS) divided the Mexican CLas strains into two groups: Term-G including four strains from Yucatán and Chiapas, as well as strain psy62 from Florida, USA, and Term-A included all other 36 Mexican strains, as well as strain AHCA1 from California, USA. CLas diversity was further evaluated to include all chromosomal and prophage genes assisted by using machine learning (ML) tools to resolve multidimensional data handling issues. A Term-G strain (YTMX) and a Term-A strain (BCSMX) were sequenced and analyzed. The two Mexican genome sequences along with the CLas genome sequences available in GenBank were studied. An unsupervised ML was implemented through principal component analysis (PCA) on average nucleotide identities (ANIs) of CLas whole genome sequences; And a supervised ML was implemented through sparse partial least squares discriminant analysis (sPLS-DA) on single nucleotide polymorphisms (SNPs) of coding genes of CLas guided by the TTS. Two CLas Geno-groups, Geno-group 1 that extended Term-A and Geno-group 2 that extended Term-G, were established. CONCLUSIONS: This study concluded that: 1) there were at least two different introductions of CLas into Mexico; 2) CLas strains between Mexico and USA are closely related; and 3) The two Geno-groups provide the basis for future CLas subspecies research.
format Online
Article
Text
id pubmed-9798433
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97984332022-12-30 Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico Huang, Jiaquan Alanís-Martínez, Iobana Kumagai, Lucita Dai, Zehan Zheng, Zheng Perez de Leon, Adalberto A. Chen, Jianchi Deng, Xiaoling Front Plant Sci Plant Science BACKGROUND: Huanglongbing (HLB, yellow shoot disease) is a highly destructive citrus disease associated with a nonculturable bacterium, “Candidatus Liberibacter asiaticus” (CLas), which is transmitted by Asian citrus psyllid (ACP, Diaphorina citri). In Mexico, HLB was first reported in Tizimin, Yucatán, in 2009 and is now endemic in 351 municipalities of 25 states. Understanding the population diversity of CLas is critical for HLB management. Current CLas diversity research is exclusively based on analysis of the bacterial genome, which composed two regions, chromosome (> 1,000 genes) and prophage (about 40 genes). METHODS AND RESULTS: In this study, 40 CLas-infected ACP samples from 20 states in Mexico were collected. CLas was detected and confirmed by PCR assays. A prophage gene(terL)-based typing system (TTS) divided the Mexican CLas strains into two groups: Term-G including four strains from Yucatán and Chiapas, as well as strain psy62 from Florida, USA, and Term-A included all other 36 Mexican strains, as well as strain AHCA1 from California, USA. CLas diversity was further evaluated to include all chromosomal and prophage genes assisted by using machine learning (ML) tools to resolve multidimensional data handling issues. A Term-G strain (YTMX) and a Term-A strain (BCSMX) were sequenced and analyzed. The two Mexican genome sequences along with the CLas genome sequences available in GenBank were studied. An unsupervised ML was implemented through principal component analysis (PCA) on average nucleotide identities (ANIs) of CLas whole genome sequences; And a supervised ML was implemented through sparse partial least squares discriminant analysis (sPLS-DA) on single nucleotide polymorphisms (SNPs) of coding genes of CLas guided by the TTS. Two CLas Geno-groups, Geno-group 1 that extended Term-A and Geno-group 2 that extended Term-G, were established. CONCLUSIONS: This study concluded that: 1) there were at least two different introductions of CLas into Mexico; 2) CLas strains between Mexico and USA are closely related; and 3) The two Geno-groups provide the basis for future CLas subspecies research. Frontiers Media S.A. 2022-12-15 /pmc/articles/PMC9798433/ /pubmed/36589083 http://dx.doi.org/10.3389/fpls.2022.1052680 Text en Copyright © 2022 Huang, Alanís-Martínez, Kumagai, Dai, Zheng, Perez de Leon, Chen and Deng https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Huang, Jiaquan
Alanís-Martínez, Iobana
Kumagai, Lucita
Dai, Zehan
Zheng, Zheng
Perez de Leon, Adalberto A.
Chen, Jianchi
Deng, Xiaoling
Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title_full Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title_fullStr Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title_full_unstemmed Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title_short Machine learning and analysis of genomic diversity of “Candidatus Liberibacter asiaticus” strains from 20 citrus production states in Mexico
title_sort machine learning and analysis of genomic diversity of “candidatus liberibacter asiaticus” strains from 20 citrus production states in mexico
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798433/
https://www.ncbi.nlm.nih.gov/pubmed/36589083
http://dx.doi.org/10.3389/fpls.2022.1052680
work_keys_str_mv AT huangjiaquan machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT alanismartineziobana machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT kumagailucita machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT daizehan machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT zhengzheng machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT perezdeleonadalbertoa machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT chenjianchi machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico
AT dengxiaoling machinelearningandanalysisofgenomicdiversityofcandidatusliberibacterasiaticusstrainsfrom20citrusproductionstatesinmexico