Cargando…

Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa

Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes co...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Fan, Kang, Junmei, Long, Ruicai, Li, Mingna, Sun, Yan, He, Fei, Jiang, Xueqian, Yang, Changfu, Yang, Xijiang, Kong, Jie, Wang, Yiwen, Wang, Zhen, Zhang, Zhiwu, Yang, Qingchuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832841/
https://www.ncbi.nlm.nih.gov/pubmed/36643744
http://dx.doi.org/10.1093/hr/uhac225
_version_ 1784868138136895488
author Zhang, Fan
Kang, Junmei
Long, Ruicai
Li, Mingna
Sun, Yan
He, Fei
Jiang, Xueqian
Yang, Changfu
Yang, Xijiang
Kong, Jie
Wang, Yiwen
Wang, Zhen
Zhang, Zhiwu
Yang, Qingchuan
author_facet Zhang, Fan
Kang, Junmei
Long, Ruicai
Li, Mingna
Sun, Yan
He, Fei
Jiang, Xueqian
Yang, Changfu
Yang, Xijiang
Kong, Jie
Wang, Yiwen
Wang, Zhen
Zhang, Zhiwu
Yang, Qingchuan
author_sort Zhang, Fan
collection PubMed
description Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes correlated with FD; however, these genes cannot predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole-genome SNP markers based on machine learning-related methods, including support vector machine (SVM) regression, and regularization-related methods, such as Lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3000 genome-wide association study (GWAS)-associated markers achieved the highest prediction accuracy for FD of 64.1%. For plant regrowth height, the prediction accuracy was 59.0% using the 3000 GWAS-associated markers and the SVM linear model. This was better than the results using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as Lasso and ElasticNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD.
format Online
Article
Text
id pubmed-9832841
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98328412023-01-12 Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa Zhang, Fan Kang, Junmei Long, Ruicai Li, Mingna Sun, Yan He, Fei Jiang, Xueqian Yang, Changfu Yang, Xijiang Kong, Jie Wang, Yiwen Wang, Zhen Zhang, Zhiwu Yang, Qingchuan Hortic Res Article Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes correlated with FD; however, these genes cannot predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole-genome SNP markers based on machine learning-related methods, including support vector machine (SVM) regression, and regularization-related methods, such as Lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3000 genome-wide association study (GWAS)-associated markers achieved the highest prediction accuracy for FD of 64.1%. For plant regrowth height, the prediction accuracy was 59.0% using the 3000 GWAS-associated markers and the SVM linear model. This was better than the results using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as Lasso and ElasticNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD. Oxford University Press 2022-10-07 /pmc/articles/PMC9832841/ /pubmed/36643744 http://dx.doi.org/10.1093/hr/uhac225 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nanjing Agricultural University. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Article
Zhang, Fan
Kang, Junmei
Long, Ruicai
Li, Mingna
Sun, Yan
He, Fei
Jiang, Xueqian
Yang, Changfu
Yang, Xijiang
Kong, Jie
Wang, Yiwen
Wang, Zhen
Zhang, Zhiwu
Yang, Qingchuan
Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title_full Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title_fullStr Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title_full_unstemmed Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title_short Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
title_sort application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832841/
https://www.ncbi.nlm.nih.gov/pubmed/36643744
http://dx.doi.org/10.1093/hr/uhac225
work_keys_str_mv AT zhangfan applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT kangjunmei applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT longruicai applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT limingna applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT sunyan applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT hefei applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT jiangxueqian applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT yangchangfu applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT yangxijiang applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT kongjie applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT wangyiwen applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT wangzhen applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT zhangzhiwu applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa
AT yangqingchuan applicationofmachinelearningtoexplorethegenomicpredictionaccuracyoffalldormancyinautotetraploidalfalfa