Cargando…

Incorporating kernelized multi-omics data improves the accuracy of genomic prediction

BACKGROUND: Genomic selection (GS) has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes. Besides genome, transcriptome and metabolome information are increasingly considered new sources for GS. Difficulties in building the model...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Mang, An, Bingxing, Chang, Tianpeng, Deng, Tianyu, Du, Lili, Li, Keanning, Cao, Sheng, Du, Yueying, Xu, Lingyang, Zhang, Lupei, Gao, Xue, Li, Junya, Gao, Huijiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9490992/
https://www.ncbi.nlm.nih.gov/pubmed/36127743
http://dx.doi.org/10.1186/s40104-022-00756-6
_version_ 1784793200664248320
author Liang, Mang
An, Bingxing
Chang, Tianpeng
Deng, Tianyu
Du, Lili
Li, Keanning
Cao, Sheng
Du, Yueying
Xu, Lingyang
Zhang, Lupei
Gao, Xue
Li, Junya
Gao, Huijiang
author_facet Liang, Mang
An, Bingxing
Chang, Tianpeng
Deng, Tianyu
Du, Lili
Li, Keanning
Cao, Sheng
Du, Yueying
Xu, Lingyang
Zhang, Lupei
Gao, Xue
Li, Junya
Gao, Huijiang
author_sort Liang, Mang
collection PubMed
description BACKGROUND: Genomic selection (GS) has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes. Besides genome, transcriptome and metabolome information are increasingly considered new sources for GS. Difficulties in building the model with multi-omics data for GS and the limit of specimen availability have both delayed the progress of investigating multi-omics. RESULTS: We utilized the Cosine kernel to map genomic and transcriptomic data as [Formula: see text] symmetric matrix (G matrix and T matrix), combined with the best linear unbiased prediction (BLUP) for GS. Here, we defined five kernel-based prediction models: genomic BLUP (GBLUP), transcriptome-BLUP (TBLUP), multi-omics BLUP (MBLUP, [Formula: see text] ), multi-omics single-step BLUP (mssBLUP), and weighted multi-omics single-step BLUP (wmssBLUP) to integrate transcribed individuals and genotyped resource population. The predictive accuracy evaluations in four traits of the Chinese Simmental beef cattle population showed that (1) MBLUP was far preferred to GBLUP (ratio = 1.0), (2) the prediction accuracy of wmssBLUP and mssBLUP had 4.18% and 3.37% average improvement over GBLUP, (3) We also found the accuracy of wmssBLUP increased with the growing proportion of transcribed cattle in the whole resource population. CONCLUSIONS: We concluded that the inclusion of transcriptome data in GS had the potential to improve accuracy. Moreover, wmssBLUP is accepted to be a promising alternative for the present situation in which plenty of individuals are genotyped when fewer are transcribed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-022-00756-6.
format Online
Article
Text
id pubmed-9490992
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94909922022-09-22 Incorporating kernelized multi-omics data improves the accuracy of genomic prediction Liang, Mang An, Bingxing Chang, Tianpeng Deng, Tianyu Du, Lili Li, Keanning Cao, Sheng Du, Yueying Xu, Lingyang Zhang, Lupei Gao, Xue Li, Junya Gao, Huijiang J Anim Sci Biotechnol Research BACKGROUND: Genomic selection (GS) has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes. Besides genome, transcriptome and metabolome information are increasingly considered new sources for GS. Difficulties in building the model with multi-omics data for GS and the limit of specimen availability have both delayed the progress of investigating multi-omics. RESULTS: We utilized the Cosine kernel to map genomic and transcriptomic data as [Formula: see text] symmetric matrix (G matrix and T matrix), combined with the best linear unbiased prediction (BLUP) for GS. Here, we defined five kernel-based prediction models: genomic BLUP (GBLUP), transcriptome-BLUP (TBLUP), multi-omics BLUP (MBLUP, [Formula: see text] ), multi-omics single-step BLUP (mssBLUP), and weighted multi-omics single-step BLUP (wmssBLUP) to integrate transcribed individuals and genotyped resource population. The predictive accuracy evaluations in four traits of the Chinese Simmental beef cattle population showed that (1) MBLUP was far preferred to GBLUP (ratio = 1.0), (2) the prediction accuracy of wmssBLUP and mssBLUP had 4.18% and 3.37% average improvement over GBLUP, (3) We also found the accuracy of wmssBLUP increased with the growing proportion of transcribed cattle in the whole resource population. CONCLUSIONS: We concluded that the inclusion of transcriptome data in GS had the potential to improve accuracy. Moreover, wmssBLUP is accepted to be a promising alternative for the present situation in which plenty of individuals are genotyped when fewer are transcribed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-022-00756-6. BioMed Central 2022-09-20 /pmc/articles/PMC9490992/ /pubmed/36127743 http://dx.doi.org/10.1186/s40104-022-00756-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liang, Mang
An, Bingxing
Chang, Tianpeng
Deng, Tianyu
Du, Lili
Li, Keanning
Cao, Sheng
Du, Yueying
Xu, Lingyang
Zhang, Lupei
Gao, Xue
Li, Junya
Gao, Huijiang
Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title_full Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title_fullStr Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title_full_unstemmed Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title_short Incorporating kernelized multi-omics data improves the accuracy of genomic prediction
title_sort incorporating kernelized multi-omics data improves the accuracy of genomic prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9490992/
https://www.ncbi.nlm.nih.gov/pubmed/36127743
http://dx.doi.org/10.1186/s40104-022-00756-6
work_keys_str_mv AT liangmang incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT anbingxing incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT changtianpeng incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT dengtianyu incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT dulili incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT likeanning incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT caosheng incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT duyueying incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT xulingyang incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT zhanglupei incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT gaoxue incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT lijunya incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction
AT gaohuijiang incorporatingkernelizedmultiomicsdataimprovestheaccuracyofgenomicprediction