Cargando…

Medium-coverage DNA sequencing in the design of the genetic association study

DNA sequencing is a widely used tool in genetic association study. Sequencing cost remains a major concern in sequencing-based study, although the application of next generation sequencing has dramatically decreased the sequencing cost and increased the efficiency. The choice of sequencing depth and...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Chao, Zhang, Ruiyuan, Shen, Hui, Deng, Hong-Wen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608440/
https://www.ncbi.nlm.nih.gov/pubmed/32457519
http://dx.doi.org/10.1038/s41431-020-0656-2
_version_ 1783604839832354816
author Xu, Chao
Zhang, Ruiyuan
Shen, Hui
Deng, Hong-Wen
author_facet Xu, Chao
Zhang, Ruiyuan
Shen, Hui
Deng, Hong-Wen
author_sort Xu, Chao
collection PubMed
description DNA sequencing is a widely used tool in genetic association study. Sequencing cost remains a major concern in sequencing-based study, although the application of next generation sequencing has dramatically decreased the sequencing cost and increased the efficiency. The choice of sequencing depth and the sequencing sample size will largely determine the final study investment and performance. Many studies have been conducted to find a cost-effective design of sequencing depth that can achieve certain sequencing accuracy using minimal sequencing cost. The strategies previously studied can be classified into two groups: (1) single-stage to sequence all the samples using either high (>~30×) or low (<~10×) sequencing depth; and (2) two-stage to sequence an affordable number of individuals at a high-coverage followed by a large sample of low-coverage sequencing. However, limited studies examined the performance of the medium-coverage (10–30×) sequencing depth for a genetic association study, where the optimum sequencing depth may exist. In this study, using a published simulation framework, we comprehensively compared the medium-coverage sequencing (MCS) to the single- and two-stage high/low-coverage sequencing in terms of the power and type I error of the variant discovery and association testing. We found, given certain sequencing effort, MCS yielded a comparable discovery power and better type I error control compared with the best (highest power) scenarios using other high- and low-coverage single-stage or two-stage designs. However, MCS was not as competent as other designs with respect to the association power, especially for the rare variants and when the sequencing investment was limited.
format Online
Article
Text
id pubmed-7608440
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-76084402020-11-05 Medium-coverage DNA sequencing in the design of the genetic association study Xu, Chao Zhang, Ruiyuan Shen, Hui Deng, Hong-Wen Eur J Hum Genet Article DNA sequencing is a widely used tool in genetic association study. Sequencing cost remains a major concern in sequencing-based study, although the application of next generation sequencing has dramatically decreased the sequencing cost and increased the efficiency. The choice of sequencing depth and the sequencing sample size will largely determine the final study investment and performance. Many studies have been conducted to find a cost-effective design of sequencing depth that can achieve certain sequencing accuracy using minimal sequencing cost. The strategies previously studied can be classified into two groups: (1) single-stage to sequence all the samples using either high (>~30×) or low (<~10×) sequencing depth; and (2) two-stage to sequence an affordable number of individuals at a high-coverage followed by a large sample of low-coverage sequencing. However, limited studies examined the performance of the medium-coverage (10–30×) sequencing depth for a genetic association study, where the optimum sequencing depth may exist. In this study, using a published simulation framework, we comprehensively compared the medium-coverage sequencing (MCS) to the single- and two-stage high/low-coverage sequencing in terms of the power and type I error of the variant discovery and association testing. We found, given certain sequencing effort, MCS yielded a comparable discovery power and better type I error control compared with the best (highest power) scenarios using other high- and low-coverage single-stage or two-stage designs. However, MCS was not as competent as other designs with respect to the association power, especially for the rare variants and when the sequencing investment was limited. Springer International Publishing 2020-05-26 2020-10 /pmc/articles/PMC7608440/ /pubmed/32457519 http://dx.doi.org/10.1038/s41431-020-0656-2 Text en © The Author(s), under exclusive licence to European Society of Human Genetics 2020
spellingShingle Article
Xu, Chao
Zhang, Ruiyuan
Shen, Hui
Deng, Hong-Wen
Medium-coverage DNA sequencing in the design of the genetic association study
title Medium-coverage DNA sequencing in the design of the genetic association study
title_full Medium-coverage DNA sequencing in the design of the genetic association study
title_fullStr Medium-coverage DNA sequencing in the design of the genetic association study
title_full_unstemmed Medium-coverage DNA sequencing in the design of the genetic association study
title_short Medium-coverage DNA sequencing in the design of the genetic association study
title_sort medium-coverage dna sequencing in the design of the genetic association study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608440/
https://www.ncbi.nlm.nih.gov/pubmed/32457519
http://dx.doi.org/10.1038/s41431-020-0656-2
work_keys_str_mv AT xuchao mediumcoveragednasequencinginthedesignofthegeneticassociationstudy
AT zhangruiyuan mediumcoveragednasequencinginthedesignofthegeneticassociationstudy
AT shenhui mediumcoveragednasequencinginthedesignofthegeneticassociationstudy
AT denghongwen mediumcoveragednasequencinginthedesignofthegeneticassociationstudy