Cargando…

Two-phase and family-based designs for next-generation sequencing studies

The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing...

Descripción completa

Detalles Bibliográficos
Autores principales:	Thomas, Duncan C., Yang, Zhao, Yang, Fan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2013
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861783/ https://www.ncbi.nlm.nih.gov/pubmed/24379824 http://dx.doi.org/10.3389/fgene.2013.00276

_version_	1782295685783093248
author	Thomas, Duncan C. Yang, Zhao Yang, Fan
author_facet	Thomas, Duncan C. Yang, Zhao Yang, Fan
author_sort	Thomas, Duncan C.
collection	PubMed
description	The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing studies: two-phase designs for targeted follow-up of genomewide association studies using unrelated individuals; and family-based designs exploiting co-segregation for prioritizing variants and genes. Two-phase designs subsample subjects for sequencing from a larger case-control study jointly on the basis of their disease and carrier status; the discovered variants are then tested for association in the parent study. The analysis combines the full sequence data from the substudy with the more limited SNP data from the main study. We discuss various methods for selecting this subset of variants and describe the expected yield of true positive associations in the context of an on-going study of second breast cancers following radiotherapy. While the sharing of variants within families means that family-based designs are less efficient for discovery than sequencing unrelated individuals, the ability to exploit co-segregation of variants with disease within families helps distinguish causal from non-causal ones. Furthermore, by enriching for family history, the yield of causal variants can be improved and use of identity-by-descent information improves imputation of genotypes for other family members. We compare the relative efficiency of these designs with those using unrelated individuals for discovering and prioritizing variants or genes for testing association in larger studies. While associations can be tested with single variants, power is low for rare ones. Recent generalizations of burden or kernel tests for gene-level associations to family-based data are appealing. These approaches are illustrated in the context of a family-based study of colorectal cancer.
format	Online Article Text
id	pubmed-3861783
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-38617832013-12-30 Two-phase and family-based designs for next-generation sequencing studies Thomas, Duncan C. Yang, Zhao Yang, Fan Front Genet Genetics The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing studies: two-phase designs for targeted follow-up of genomewide association studies using unrelated individuals; and family-based designs exploiting co-segregation for prioritizing variants and genes. Two-phase designs subsample subjects for sequencing from a larger case-control study jointly on the basis of their disease and carrier status; the discovered variants are then tested for association in the parent study. The analysis combines the full sequence data from the substudy with the more limited SNP data from the main study. We discuss various methods for selecting this subset of variants and describe the expected yield of true positive associations in the context of an on-going study of second breast cancers following radiotherapy. While the sharing of variants within families means that family-based designs are less efficient for discovery than sequencing unrelated individuals, the ability to exploit co-segregation of variants with disease within families helps distinguish causal from non-causal ones. Furthermore, by enriching for family history, the yield of causal variants can be improved and use of identity-by-descent information improves imputation of genotypes for other family members. We compare the relative efficiency of these designs with those using unrelated individuals for discovering and prioritizing variants or genes for testing association in larger studies. While associations can be tested with single variants, power is low for rare ones. Recent generalizations of burden or kernel tests for gene-level associations to family-based data are appealing. These approaches are illustrated in the context of a family-based study of colorectal cancer. Frontiers Media S.A. 2013-12-13 /pmc/articles/PMC3861783/ /pubmed/24379824 http://dx.doi.org/10.3389/fgene.2013.00276 Text en Copyright © 2013 Thomas, Yang and Yang. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Thomas, Duncan C. Yang, Zhao Yang, Fan Two-phase and family-based designs for next-generation sequencing studies
title	Two-phase and family-based designs for next-generation sequencing studies
title_full	Two-phase and family-based designs for next-generation sequencing studies
title_fullStr	Two-phase and family-based designs for next-generation sequencing studies
title_full_unstemmed	Two-phase and family-based designs for next-generation sequencing studies
title_short	Two-phase and family-based designs for next-generation sequencing studies
title_sort	two-phase and family-based designs for next-generation sequencing studies
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861783/ https://www.ncbi.nlm.nih.gov/pubmed/24379824 http://dx.doi.org/10.3389/fgene.2013.00276
work_keys_str_mv	AT thomasduncanc twophaseandfamilybaseddesignsfornextgenerationsequencingstudies AT yangzhao twophaseandfamilybaseddesignsfornextgenerationsequencingstudies AT yangfan twophaseandfamilybaseddesignsfornextgenerationsequencingstudies

Two-phase and family-based designs for next-generation sequencing studies

Ejemplares similares