Cargando…
Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction
BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that ca...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708144/ https://www.ncbi.nlm.nih.gov/pubmed/33292577 http://dx.doi.org/10.1186/s40104-020-00515-5 |
_version_ | 1783617503871631360 |
---|---|
author | Ye, Shaopan Li, Jiaqi Zhang, Zhe |
author_facet | Ye, Shaopan Li, Jiaqi Zhang, Zhe |
author_sort | Ye, Shaopan |
collection | PubMed |
description | BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. RESULTS: Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. CONCLUSIONS: Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction. |
format | Online Article Text |
id | pubmed-7708144 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77081442020-12-02 Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction Ye, Shaopan Li, Jiaqi Zhang, Zhe J Anim Sci Biotechnol Research BACKGROUND: Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. RESULTS: Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. CONCLUSIONS: Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction. BioMed Central 2020-12-01 /pmc/articles/PMC7708144/ /pubmed/33292577 http://dx.doi.org/10.1186/s40104-020-00515-5 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Ye, Shaopan Li, Jiaqi Zhang, Zhe Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title | Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title_full | Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title_fullStr | Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title_full_unstemmed | Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title_short | Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
title_sort | multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708144/ https://www.ncbi.nlm.nih.gov/pubmed/33292577 http://dx.doi.org/10.1186/s40104-020-00515-5 |
work_keys_str_mv | AT yeshaopan multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction AT lijiaqi multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction AT zhangzhe multiomicsdataassistedgenomicfeaturemarkerspreselectionimprovestheaccuracyofgenomicprediction |