Cargando…
Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants?
Use of trait-dependent sampling designs in whole-genome association studies of sequence data can reduce total sequencing costs with modest losses of statistical efficiency. In a quantitative trait (QT) analysis of data from the Genetic Analysis Workshop 17 mini-exome for unrelated individuals in the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287835/ https://www.ncbi.nlm.nih.gov/pubmed/22373146 http://dx.doi.org/10.1186/1753-6561-5-S9-S111 |
_version_ | 1782224754479988736 |
---|---|
author | Yilmaz, Yildiz E Bull, Shelley B |
author_facet | Yilmaz, Yildiz E Bull, Shelley B |
author_sort | Yilmaz, Yildiz E |
collection | PubMed |
description | Use of trait-dependent sampling designs in whole-genome association studies of sequence data can reduce total sequencing costs with modest losses of statistical efficiency. In a quantitative trait (QT) analysis of data from the Genetic Analysis Workshop 17 mini-exome for unrelated individuals in the Asian subpopulation, we investigate alternative designs that sequence only 50% of the entire cohort. In addition to a simple random sampling design, we consider extreme-phenotype designs that are of increasing interest in genetic association analysis of QTs, especially in studies concerned with the detection of rare genetic variants. We also evaluate a novel sampling design in which all individuals have a nonzero probability of being selected into the sample but in which individuals with extreme phenotypes have a proportionately larger probability. We take differential sampling of individuals with informative trait values into account by inverse probability weighting using standard survey methods which thus generalizes to the source population. In replicate 1 data, we applied the designs in association analysis of Q1 with both rare and common variants in the FLT1 gene, based on knowledge of the generating model. Using all 200 replicate data sets, we similarly analyzed Q1 and Q4 (which is known to be free of association with FLT1) to evaluate relative efficiency, type I error, and power. Simulation study results suggest that the QT-dependent selection designs generally yield greater than 50% relative efficiency compared to using the entire cohort, implying cost-effectiveness of 50% sample selection and worthwhile reduction of sequencing costs. |
format | Online Article Text |
id | pubmed-3287835 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32878352012-02-28 Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? Yilmaz, Yildiz E Bull, Shelley B BMC Proc Proceedings Use of trait-dependent sampling designs in whole-genome association studies of sequence data can reduce total sequencing costs with modest losses of statistical efficiency. In a quantitative trait (QT) analysis of data from the Genetic Analysis Workshop 17 mini-exome for unrelated individuals in the Asian subpopulation, we investigate alternative designs that sequence only 50% of the entire cohort. In addition to a simple random sampling design, we consider extreme-phenotype designs that are of increasing interest in genetic association analysis of QTs, especially in studies concerned with the detection of rare genetic variants. We also evaluate a novel sampling design in which all individuals have a nonzero probability of being selected into the sample but in which individuals with extreme phenotypes have a proportionately larger probability. We take differential sampling of individuals with informative trait values into account by inverse probability weighting using standard survey methods which thus generalizes to the source population. In replicate 1 data, we applied the designs in association analysis of Q1 with both rare and common variants in the FLT1 gene, based on knowledge of the generating model. Using all 200 replicate data sets, we similarly analyzed Q1 and Q4 (which is known to be free of association with FLT1) to evaluate relative efficiency, type I error, and power. Simulation study results suggest that the QT-dependent selection designs generally yield greater than 50% relative efficiency compared to using the entire cohort, implying cost-effectiveness of 50% sample selection and worthwhile reduction of sequencing costs. BioMed Central 2011-11-29 /pmc/articles/PMC3287835/ /pubmed/22373146 http://dx.doi.org/10.1186/1753-6561-5-S9-S111 Text en Copyright ©2011 Yilmaz and Bull; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Yilmaz, Yildiz E Bull, Shelley B Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title | Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title_full | Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title_fullStr | Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title_full_unstemmed | Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title_short | Are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
title_sort | are quantitative trait-dependent sampling designs cost-effective for analysis of rare and common variants? |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287835/ https://www.ncbi.nlm.nih.gov/pubmed/22373146 http://dx.doi.org/10.1186/1753-6561-5-S9-S111 |
work_keys_str_mv | AT yilmazyildize arequantitativetraitdependentsamplingdesignscosteffectiveforanalysisofrareandcommonvariants AT bullshelleyb arequantitativetraitdependentsamplingdesignscosteffectiveforanalysisofrareandcommonvariants |