Cargando…

A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing

Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Qian, Lu, Qiongshi, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403555/
https://www.ncbi.nlm.nih.gov/pubmed/25941534
http://dx.doi.org/10.3389/fgene.2015.00149
_version_ 1782367348499415040
author Wang, Qian
Lu, Qiongshi
Zhao, Hongyu
author_facet Wang, Qian
Lu, Qiongshi
Zhao, Hongyu
author_sort Wang, Qian
collection PubMed
description Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide association studies in the past 10 years, it is challenging to interpret these results as most disease-associated markers have no clear functional roles in disease etiology, and all the identified genomic factors only explain a small portion of disease heritability. With the help of next-generation sequencing (NGS), diverse types of genomic and epigenetic variations can be detected with high accuracy. More importantly, instead of using linkage disequilibrium to detect association signals based on a set of pre-set probes, NGS allows researchers to directly study all the variants in each individual, therefore promises opportunities for identifying functional variants and a more comprehensive dissection of disease heritability. Although the current scale of NGS studies is still limited due to the high cost, the success of several recent studies suggests the great potential for applying NGS in genomic epidemiology, especially as the cost of sequencing continues to drop. In this review, we discuss several pioneer applications of NGS, summarize scientific discoveries for rare and complex diseases, and compare various study designs including targeted sequencing and whole-genome sequencing using population-based and family-based cohorts. Finally, we highlight recent advancements in statistical methods proposed for sequencing analysis, including group-based association tests, meta-analysis techniques, and annotation tools for variant prioritization.
format Online
Article
Text
id pubmed-4403555
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-44035552015-05-04 A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing Wang, Qian Lu, Qiongshi Zhao, Hongyu Front Genet Genetics Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide association studies in the past 10 years, it is challenging to interpret these results as most disease-associated markers have no clear functional roles in disease etiology, and all the identified genomic factors only explain a small portion of disease heritability. With the help of next-generation sequencing (NGS), diverse types of genomic and epigenetic variations can be detected with high accuracy. More importantly, instead of using linkage disequilibrium to detect association signals based on a set of pre-set probes, NGS allows researchers to directly study all the variants in each individual, therefore promises opportunities for identifying functional variants and a more comprehensive dissection of disease heritability. Although the current scale of NGS studies is still limited due to the high cost, the success of several recent studies suggests the great potential for applying NGS in genomic epidemiology, especially as the cost of sequencing continues to drop. In this review, we discuss several pioneer applications of NGS, summarize scientific discoveries for rare and complex diseases, and compare various study designs including targeted sequencing and whole-genome sequencing using population-based and family-based cohorts. Finally, we highlight recent advancements in statistical methods proposed for sequencing analysis, including group-based association tests, meta-analysis techniques, and annotation tools for variant prioritization. Frontiers Media S.A. 2015-04-20 /pmc/articles/PMC4403555/ /pubmed/25941534 http://dx.doi.org/10.3389/fgene.2015.00149 Text en Copyright © 2015 Wang, Lu and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Qian
Lu, Qiongshi
Zhao, Hongyu
A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title_full A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title_fullStr A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title_full_unstemmed A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title_short A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
title_sort review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403555/
https://www.ncbi.nlm.nih.gov/pubmed/25941534
http://dx.doi.org/10.3389/fgene.2015.00149
work_keys_str_mv AT wangqian areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing
AT luqiongshi areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing
AT zhaohongyu areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing
AT wangqian reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing
AT luqiongshi reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing
AT zhaohongyu reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing