Cargando…

The impact of genetic structure on sequencing analysis

BACKGROUND: Genome-wide association studies have made substantial progress in identifying common variants associated with human diseases. Despite such success, a large portion of heritability remains unexplained. Evolutionary theory and empirical studies suggest that rare mutations could play an imp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jadhav, Sneha, Vsevolozhskaya, Olga A., Tong, Xiaoran, Lu, Qing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133514/ https://www.ncbi.nlm.nih.gov/pubmed/27980631 http://dx.doi.org/10.1186/s12919-016-0025-x

_version_	1782471278579417088
author	Jadhav, Sneha Vsevolozhskaya, Olga A. Tong, Xiaoran Lu, Qing
author_facet	Jadhav, Sneha Vsevolozhskaya, Olga A. Tong, Xiaoran Lu, Qing
author_sort	Jadhav, Sneha
collection	PubMed
description	BACKGROUND: Genome-wide association studies have made substantial progress in identifying common variants associated with human diseases. Despite such success, a large portion of heritability remains unexplained. Evolutionary theory and empirical studies suggest that rare mutations could play an important role in human diseases, which motivates comprehensive investigation of rare variants in sequencing studies. To explore the association of rare variants with human diseases, many statistical approaches have been developed with different ways of modeling genetic structure (ie, linkage disequilibrium). Nevertheless, the appropriate strategy to model genetic structure of sequencing data and its effect on association analysis have not been well studied. METHODS: We investigate 3 statistical approaches that use 3 different strategies to model the genetic structure of sequencing data. We proceed by comparing a burden test that assumes independence among sequencing variants, a burden test that considers pairwise linkage disequilibrium (LD), and a functional analysis of variance (FANOVA) test that models genetic data through fitting continuous curves on individuals’ genotypes. RESULTS: Through simulations, we find that FANOVA attains better or comparable performance to the 2 burden tests. Overall, the burden test that considers pairwise LD has comparable performance to the burden test that assumes independence between sequencing variants. However, for 1 gene, where the disease-associated variant is located in an LD block, we find that considering pairwise LD could improve the test’s performance. CONCLUSIONS: The structure of sequencing variants is complex in nature and its patterns vary across the whole genome. In certain cases (eg, a disease-susceptibility variant is in an LD block), ignoring the genetic structure in the association analysis could result in suboptimal performance. Through this study, we show that a functional-based method is promising for modeling the underlying genetic structure of sequencing data, which could lead to better performance.
format	Online Article Text
id	pubmed-5133514
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-51335142016-12-15 The impact of genetic structure on sequencing analysis Jadhav, Sneha Vsevolozhskaya, Olga A. Tong, Xiaoran Lu, Qing BMC Proc Proceedings BACKGROUND: Genome-wide association studies have made substantial progress in identifying common variants associated with human diseases. Despite such success, a large portion of heritability remains unexplained. Evolutionary theory and empirical studies suggest that rare mutations could play an important role in human diseases, which motivates comprehensive investigation of rare variants in sequencing studies. To explore the association of rare variants with human diseases, many statistical approaches have been developed with different ways of modeling genetic structure (ie, linkage disequilibrium). Nevertheless, the appropriate strategy to model genetic structure of sequencing data and its effect on association analysis have not been well studied. METHODS: We investigate 3 statistical approaches that use 3 different strategies to model the genetic structure of sequencing data. We proceed by comparing a burden test that assumes independence among sequencing variants, a burden test that considers pairwise linkage disequilibrium (LD), and a functional analysis of variance (FANOVA) test that models genetic data through fitting continuous curves on individuals’ genotypes. RESULTS: Through simulations, we find that FANOVA attains better or comparable performance to the 2 burden tests. Overall, the burden test that considers pairwise LD has comparable performance to the burden test that assumes independence between sequencing variants. However, for 1 gene, where the disease-associated variant is located in an LD block, we find that considering pairwise LD could improve the test’s performance. CONCLUSIONS: The structure of sequencing variants is complex in nature and its patterns vary across the whole genome. In certain cases (eg, a disease-susceptibility variant is in an LD block), ignoring the genetic structure in the association analysis could result in suboptimal performance. Through this study, we show that a functional-based method is promising for modeling the underlying genetic structure of sequencing data, which could lead to better performance. BioMed Central 2016-10-18 /pmc/articles/PMC5133514/ /pubmed/27980631 http://dx.doi.org/10.1186/s12919-016-0025-x Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Proceedings Jadhav, Sneha Vsevolozhskaya, Olga A. Tong, Xiaoran Lu, Qing The impact of genetic structure on sequencing analysis
title	The impact of genetic structure on sequencing analysis
title_full	The impact of genetic structure on sequencing analysis
title_fullStr	The impact of genetic structure on sequencing analysis
title_full_unstemmed	The impact of genetic structure on sequencing analysis
title_short	The impact of genetic structure on sequencing analysis
title_sort	impact of genetic structure on sequencing analysis
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133514/ https://www.ncbi.nlm.nih.gov/pubmed/27980631 http://dx.doi.org/10.1186/s12919-016-0025-x
work_keys_str_mv	AT jadhavsneha theimpactofgeneticstructureonsequencinganalysis AT vsevolozhskayaolgaa theimpactofgeneticstructureonsequencinganalysis AT tongxiaoran theimpactofgeneticstructureonsequencinganalysis AT luqing theimpactofgeneticstructureonsequencinganalysis AT jadhavsneha impactofgeneticstructureonsequencinganalysis AT vsevolozhskayaolgaa impactofgeneticstructureonsequencinganalysis AT tongxiaoran impactofgeneticstructureonsequencinganalysis AT luqing impactofgeneticstructureonsequencinganalysis

The impact of genetic structure on sequencing analysis

Ejemplares similares