Cargando…

Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction

BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Shaopan, Li, Jiaqi, Zhang, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708144/
https://www.ncbi.nlm.nih.gov/pubmed/33292577
http://dx.doi.org/10.1186/s40104-020-00515-5
_version_ 1783617503871631360
author Ye, Shaopan
Li, Jiaqi
Zhang, Zhe
author_facet Ye, Shaopan
Li, Jiaqi
Zhang, Zhe
author_sort Ye, Shaopan
collection PubMed
description BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. RESULTS: Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. CONCLUSIONS: Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
format Online
Article
Text
id pubmed-7708144
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77081442020-12-02 Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction Ye, Shaopan Li, Jiaqi Zhang, Zhe J Anim Sci Biotechnol Research BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. RESULTS: Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. CONCLUSIONS: Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction. BioMed Central 2020-12-01 /pmc/articles/PMC7708144/ /pubmed/33292577 http://dx.doi.org/10.1186/s40104-020-00515-5 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Ye, Shaopan
Li, Jiaqi
Zhang, Zhe
Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title_full Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title_fullStr Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title_full_unstemmed Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title_short Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
title_sort multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708144/
https://www.ncbi.nlm.nih.gov/pubmed/33292577
http://dx.doi.org/10.1186/s40104-020-00515-5
work_keys_str_mv AT yeshaopan multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction
AT lijiaqi multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction
AT zhangzhe multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction