Cargando…

Machine learning approaches reveal genomic regions associated with sugarcane brown rust resistance

Sugarcane is an economically important crop, but its genomic complexity has hindered advances in molecular approaches for genetic breeding. New cultivars are released based on the identification of interesting traits, and for sugarcane, brown rust resistance is a desirable characteristic due to the...

Descripción completa

Detalles Bibliográficos
Autores principales: Aono, Alexandre Hild, Costa, Estela Araujo, Rody, Hugo Vianna Silva, Nagai, James Shiniti, Pimenta, Ricardo José Gonzaga, Mancini, Melina Cristina, dos Santos, Fernanda Raquel Camilo, Pinto, Luciana Rossini, Landell, Marcos Guimarães de Andrade, de Souza, Anete Pereira, Kuroshu, Reginaldo Massanobu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7676261/
https://www.ncbi.nlm.nih.gov/pubmed/33208862
http://dx.doi.org/10.1038/s41598-020-77063-5
Descripción
Sumario:Sugarcane is an economically important crop, but its genomic complexity has hindered advances in molecular approaches for genetic breeding. New cultivars are released based on the identification of interesting traits, and for sugarcane, brown rust resistance is a desirable characteristic due to the large economic impact of the disease. Although marker-assisted selection for rust resistance has been successful, the genes involved are still unknown, and the associated regions vary among cultivars, thus restricting methodological generalization. We used genotyping by sequencing of full-sib progeny to relate genomic regions with brown rust phenotypes. We established a pipeline to identify reliable SNPs in complex polyploid data, which were used for phenotypic prediction via machine learning. We identified 14,540 SNPs, which led to a mean prediction accuracy of 50% when using different models. We also tested feature selection algorithms to increase predictive accuracy, resulting in a reduced dataset with more explanatory power for rust phenotypes. As a result of this approach, we achieved an accuracy of up to 95% with a dataset of 131 SNPs related to brown rust QTL regions and auxiliary genes. Therefore, our novel strategy has the potential to assist studies of the genomic organization of brown rust resistance in sugarcane.