Cargando…

Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms

Assessments of genomic prediction accuracies using artificial intelligent (AI) algorithms (i.e., machine and deep learning methods) are currently not available or very limited in aquaculture species. The principal aim of this study was to examine the predictive performance of these new methods for d...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Nguyen Thanh, Phuc, Tran Huu, Oanh, Kim Thi Phuong, Sang, Nguyen Van, Trang, Trinh Thi, Nguyen, Nguyen Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727988/
https://www.ncbi.nlm.nih.gov/pubmed/34788431
http://dx.doi.org/10.1093/g3journal/jkab361
_version_ 1784626631696973824
author Vu, Nguyen Thanh
Phuc, Tran Huu
Oanh, Kim Thi Phuong
Sang, Nguyen Van
Trang, Trinh Thi
Nguyen, Nguyen Hong
author_facet Vu, Nguyen Thanh
Phuc, Tran Huu
Oanh, Kim Thi Phuong
Sang, Nguyen Van
Trang, Trinh Thi
Nguyen, Nguyen Hong
author_sort Vu, Nguyen Thanh
collection PubMed
description Assessments of genomic prediction accuracies using artificial intelligent (AI) algorithms (i.e., machine and deep learning methods) are currently not available or very limited in aquaculture species. The principal aim of this study was to examine the predictive performance of these new methods for disease resistance to Edwardsiella ictaluri in a population of striped catfish Pangasianodon hypophthalmus and to make comparisons with four common methods, i.e., pedigree-based best linear unbiased prediction (PBLUP), genomic-based best linear unbiased prediction (GBLUP), single-step GBLUP (ssGBLUP) and a nonlinear Bayesian approach (notably BayesR). Our analyses using machine learning (i.e., ML-KAML) and deep learning (i.e., DL-MLP and DL-CNN) together with the four common methods (PBLUP, GBLUP, ssGBLUP, and BayesR) were conducted for two main disease resistance traits (i.e., survival status coded as 0 and 1 and survival time, i.e., days that the animals were still alive after the challenge test) in a pedigree consisting of 560 individual animals (490 offspring and 70 parents) genotyped for 14,154 single nucleotide polymorphism (SNPs). The results using 6,470 SNPs after quality control showed that machine learning methods outperformed PBLUP, GBLUP, and ssGBLUP, with the increases in the prediction accuracies for both traits by 9.1–15.4%. However, the prediction accuracies obtained from machine learning methods were comparable to those estimated using BayesR. Imputation of missing genotypes using AlphaFamImpute increased the prediction accuracies by 5.3–19.2% in all the methods and data used. On the other hand, there were insignificant decreases (0.3–5.6%) in the prediction accuracies for both survival status and survival time when multivariate models were used in comparison to univariate analyses. Interestingly, the genomic prediction accuracies based on only highly significant SNPs (P < 0.00001, 318–400 SNPs for survival status and 1,362–1,589 SNPs for survival time) were somewhat lower (0.3–15.6%) than those obtained from the whole set of 6,470 SNPs. In most of our analyses, the accuracies of genomic prediction were somewhat higher for survival time than survival status (0/1 data). It is concluded that although there are prospects for the application of genomic selection to increase disease resistance to E. ictaluri in striped catfish breeding programs, further evaluation of these methods should be made in independent families/populations when more data are accumulated in future generations to avoid possible biases in the genetic parameters estimates and prediction accuracies for the disease-resistant traits studied in this population of striped catfish P. hypophthalmus.
format Online
Article
Text
id pubmed-8727988
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87279882022-01-05 Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms Vu, Nguyen Thanh Phuc, Tran Huu Oanh, Kim Thi Phuong Sang, Nguyen Van Trang, Trinh Thi Nguyen, Nguyen Hong G3 (Bethesda) Investigation Assessments of genomic prediction accuracies using artificial intelligent (AI) algorithms (i.e., machine and deep learning methods) are currently not available or very limited in aquaculture species. The principal aim of this study was to examine the predictive performance of these new methods for disease resistance to Edwardsiella ictaluri in a population of striped catfish Pangasianodon hypophthalmus and to make comparisons with four common methods, i.e., pedigree-based best linear unbiased prediction (PBLUP), genomic-based best linear unbiased prediction (GBLUP), single-step GBLUP (ssGBLUP) and a nonlinear Bayesian approach (notably BayesR). Our analyses using machine learning (i.e., ML-KAML) and deep learning (i.e., DL-MLP and DL-CNN) together with the four common methods (PBLUP, GBLUP, ssGBLUP, and BayesR) were conducted for two main disease resistance traits (i.e., survival status coded as 0 and 1 and survival time, i.e., days that the animals were still alive after the challenge test) in a pedigree consisting of 560 individual animals (490 offspring and 70 parents) genotyped for 14,154 single nucleotide polymorphism (SNPs). The results using 6,470 SNPs after quality control showed that machine learning methods outperformed PBLUP, GBLUP, and ssGBLUP, with the increases in the prediction accuracies for both traits by 9.1–15.4%. However, the prediction accuracies obtained from machine learning methods were comparable to those estimated using BayesR. Imputation of missing genotypes using AlphaFamImpute increased the prediction accuracies by 5.3–19.2% in all the methods and data used. On the other hand, there were insignificant decreases (0.3–5.6%) in the prediction accuracies for both survival status and survival time when multivariate models were used in comparison to univariate analyses. Interestingly, the genomic prediction accuracies based on only highly significant SNPs (P < 0.00001, 318–400 SNPs for survival status and 1,362–1,589 SNPs for survival time) were somewhat lower (0.3–15.6%) than those obtained from the whole set of 6,470 SNPs. In most of our analyses, the accuracies of genomic prediction were somewhat higher for survival time than survival status (0/1 data). It is concluded that although there are prospects for the application of genomic selection to increase disease resistance to E. ictaluri in striped catfish breeding programs, further evaluation of these methods should be made in independent families/populations when more data are accumulated in future generations to avoid possible biases in the genetic parameters estimates and prediction accuracies for the disease-resistant traits studied in this population of striped catfish P. hypophthalmus. Oxford University Press 2021-10-22 /pmc/articles/PMC8727988/ /pubmed/34788431 http://dx.doi.org/10.1093/g3journal/jkab361 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Vu, Nguyen Thanh
Phuc, Tran Huu
Oanh, Kim Thi Phuong
Sang, Nguyen Van
Trang, Trinh Thi
Nguyen, Nguyen Hong
Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title_full Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title_fullStr Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title_full_unstemmed Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title_short Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms
title_sort accuracies of genomic predictions for disease resistance of striped catfish to edwardsiella ictaluri using artificial intelligence algorithms
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727988/
https://www.ncbi.nlm.nih.gov/pubmed/34788431
http://dx.doi.org/10.1093/g3journal/jkab361
work_keys_str_mv AT vunguyenthanh accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms
AT phuctranhuu accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms
AT oanhkimthiphuong accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms
AT sangnguyenvan accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms
AT trangtrinhthi accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms
AT nguyennguyenhong accuraciesofgenomicpredictionsfordiseaseresistanceofstripedcatfishtoedwardsiellaictaluriusingartificialintelligencealgorithms