Cargando…

Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster

Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we rep...

Descripción completa

Detalles Bibliográficos
Autores principales: Ober, Ulrike, Ayroles, Julien F., Stone, Eric A., Richards, Stephen, Zhu, Dianhui, Gibbs, Richard A., Stricker, Christian, Gianola, Daniel, Schlather, Martin, Mackay, Trudy F. C., Simianer, Henner
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3342952/
https://www.ncbi.nlm.nih.gov/pubmed/22570636
http://dx.doi.org/10.1371/journal.pgen.1002685
_version_ 1782231756968034304
author Ober, Ulrike
Ayroles, Julien F.
Stone, Eric A.
Richards, Stephen
Zhu, Dianhui
Gibbs, Richard A.
Stricker, Christian
Gianola, Daniel
Schlather, Martin
Mackay, Trudy F. C.
Simianer, Henner
author_facet Ober, Ulrike
Ayroles, Julien F.
Stone, Eric A.
Richards, Stephen
Zhu, Dianhui
Gibbs, Richard A.
Stricker, Christian
Gianola, Daniel
Schlather, Martin
Mackay, Trudy F. C.
Simianer, Henner
author_sort Ober, Ulrike
collection PubMed
description Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP–based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.
format Online
Article
Text
id pubmed-3342952
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33429522012-05-08 Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster Ober, Ulrike Ayroles, Julien F. Stone, Eric A. Richards, Stephen Zhu, Dianhui Gibbs, Richard A. Stricker, Christian Gianola, Daniel Schlather, Martin Mackay, Trudy F. C. Simianer, Henner PLoS Genet Research Article Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP–based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms. Public Library of Science 2012-05-03 /pmc/articles/PMC3342952/ /pubmed/22570636 http://dx.doi.org/10.1371/journal.pgen.1002685 Text en Ober et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ober, Ulrike
Ayroles, Julien F.
Stone, Eric A.
Richards, Stephen
Zhu, Dianhui
Gibbs, Richard A.
Stricker, Christian
Gianola, Daniel
Schlather, Martin
Mackay, Trudy F. C.
Simianer, Henner
Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title_full Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title_fullStr Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title_full_unstemmed Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title_short Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster
title_sort using whole-genome sequence data to predict quantitative trait phenotypes in drosophila melanogaster
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3342952/
https://www.ncbi.nlm.nih.gov/pubmed/22570636
http://dx.doi.org/10.1371/journal.pgen.1002685
work_keys_str_mv AT oberulrike usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT ayrolesjulienf usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT stoneerica usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT richardsstephen usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT zhudianhui usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT gibbsricharda usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT strickerchristian usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT gianoladaniel usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT schlathermartin usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT mackaytrudyfc usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT simianerhenner usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster