Cargando…

Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data

Given recent advances in the generation of high-throughput data such as whole-genome genetic variation and transcriptome expression, it is critical to come up with novel methods to integrate these heterogeneous datasets and to assess the significance of identified phenotype-genotype relationships. R...

Descripción completa

Detalles Bibliográficos
Autores principales: Gamazon, Eric R., Huang, R. Stephanie, Dolan, M. Eileen, Cox, Nancy J., Im, Hae Kyung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668276/
https://www.ncbi.nlm.nih.gov/pubmed/23755062
http://dx.doi.org/10.3389/fgene.2012.00202
_version_ 1782271607216013312
author Gamazon, Eric R.
Huang, R. Stephanie
Dolan, M. Eileen
Cox, Nancy J.
Im, Hae Kyung
author_facet Gamazon, Eric R.
Huang, R. Stephanie
Dolan, M. Eileen
Cox, Nancy J.
Im, Hae Kyung
author_sort Gamazon, Eric R.
collection PubMed
description Given recent advances in the generation of high-throughput data such as whole-genome genetic variation and transcriptome expression, it is critical to come up with novel methods to integrate these heterogeneous datasets and to assess the significance of identified phenotype-genotype relationships. Recent studies show that genome-wide association findings are likely to fall in loci with gene regulatory effects such as expression quantitative trait loci (eQTLs), demonstrating the utility of such integrative approaches. When genotype and gene expression data are available on the same individuals, we and others developed methods wherein top phenotype-associated genetic variants are prioritized if they are associated, as eQTLs, with gene expression traits that are themselves associated with the phenotype. Yet there has been no method to determine an overall p-value for the findings that arise specifically from the integrative nature of the approach. We propose a computationally feasible permutation method that accounts for the assimilative nature of the method and the correlation structure among gene expression traits and among genotypes. We apply the method to data from a study of cellular sensitivity to etoposide, one of the most widely used chemotherapeutic drugs. To our knowledge, this study is the first statistically sound quantification of the overall significance of the genotype-phenotype relationships resulting from applying an integrative approach. This method can be easily extended to cases in which gene expression data are replaced by other molecular phenotypes of interest, e.g., microRNA or proteomic data. This study has important implications for studies seeking to expand on genetic association studies by the use of omics data. Finally, we provide an R code to compute the empirical false discovery rate when p-values for the observed and simulated phenotypes are available.
format Online
Article
Text
id pubmed-3668276
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-36682762013-06-10 Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data Gamazon, Eric R. Huang, R. Stephanie Dolan, M. Eileen Cox, Nancy J. Im, Hae Kyung Front Genet Genetics Given recent advances in the generation of high-throughput data such as whole-genome genetic variation and transcriptome expression, it is critical to come up with novel methods to integrate these heterogeneous datasets and to assess the significance of identified phenotype-genotype relationships. Recent studies show that genome-wide association findings are likely to fall in loci with gene regulatory effects such as expression quantitative trait loci (eQTLs), demonstrating the utility of such integrative approaches. When genotype and gene expression data are available on the same individuals, we and others developed methods wherein top phenotype-associated genetic variants are prioritized if they are associated, as eQTLs, with gene expression traits that are themselves associated with the phenotype. Yet there has been no method to determine an overall p-value for the findings that arise specifically from the integrative nature of the approach. We propose a computationally feasible permutation method that accounts for the assimilative nature of the method and the correlation structure among gene expression traits and among genotypes. We apply the method to data from a study of cellular sensitivity to etoposide, one of the most widely used chemotherapeutic drugs. To our knowledge, this study is the first statistically sound quantification of the overall significance of the genotype-phenotype relationships resulting from applying an integrative approach. This method can be easily extended to cases in which gene expression data are replaced by other molecular phenotypes of interest, e.g., microRNA or proteomic data. This study has important implications for studies seeking to expand on genetic association studies by the use of omics data. Finally, we provide an R code to compute the empirical false discovery rate when p-values for the observed and simulated phenotypes are available. Frontiers Media S.A. 2013-05-31 /pmc/articles/PMC3668276/ /pubmed/23755062 http://dx.doi.org/10.3389/fgene.2012.00202 Text en Copyright © 2013 Gamazon, Huang, Dolan, Cox and Im. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Genetics
Gamazon, Eric R.
Huang, R. Stephanie
Dolan, M. Eileen
Cox, Nancy J.
Im, Hae Kyung
Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title_full Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title_fullStr Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title_full_unstemmed Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title_short Integrative Genomics: Quantifying Significance of Phenotype-Genotype Relationships from Multiple Sources of High-Throughput Data
title_sort integrative genomics: quantifying significance of phenotype-genotype relationships from multiple sources of high-throughput data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668276/
https://www.ncbi.nlm.nih.gov/pubmed/23755062
http://dx.doi.org/10.3389/fgene.2012.00202
work_keys_str_mv AT gamazonericr integrativegenomicsquantifyingsignificanceofphenotypegenotyperelationshipsfrommultiplesourcesofhighthroughputdata
AT huangrstephanie integrativegenomicsquantifyingsignificanceofphenotypegenotyperelationshipsfrommultiplesourcesofhighthroughputdata
AT dolanmeileen integrativegenomicsquantifyingsignificanceofphenotypegenotyperelationshipsfrommultiplesourcesofhighthroughputdata
AT coxnancyj integrativegenomicsquantifyingsignificanceofphenotypegenotyperelationshipsfrommultiplesourcesofhighthroughputdata
AT imhaekyung integrativegenomicsquantifyingsignificanceofphenotypegenotyperelationshipsfrommultiplesourcesofhighthroughputdata