Cargando…

The impact of genotype calling errors on family-based studies

Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygo...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Qi, Chen, Rui, Sutcliffe, James S., Cook, Edwin H., Weeks, Daniel E., Li, Bingshan, Chen, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916415/
https://www.ncbi.nlm.nih.gov/pubmed/27328765
http://dx.doi.org/10.1038/srep28323
_version_ 1782438825574793216
author Yan, Qi
Chen, Rui
Sutcliffe, James S.
Cook, Edwin H.
Weeks, Daniel E.
Li, Bingshan
Chen, Wei
author_facet Yan, Qi
Chen, Rui
Sutcliffe, James S.
Cook, Edwin H.
Weeks, Daniel E.
Li, Bingshan
Chen, Wei
author_sort Yan, Qi
collection PubMed
description Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygotes. The consequences of such non-random errors on association tests for rare variants are unclear, particularly in transmission-based tests. In this study, we investigated the impact of genotyping errors on rare variant association tests of family-based sequence data. We performed a comprehensive analysis to study how genotype calling errors affect type I error and statistical power of transmission-based association tests using a variety of realistic parameters in family-based sequencing studies. In simulation studies, we found that biased genotype calling errors yielded not only an inflation of type I error but also a power loss of association tests. We further confirmed our observation using exome sequence data from an autism project. We concluded that non-symmetric genotype calling errors need careful consideration in the analysis of family-based sequence data and we provided practical guidance on ameliorating the test bias.
format Online
Article
Text
id pubmed-4916415
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49164152016-06-27 The impact of genotype calling errors on family-based studies Yan, Qi Chen, Rui Sutcliffe, James S. Cook, Edwin H. Weeks, Daniel E. Li, Bingshan Chen, Wei Sci Rep Article Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygotes. The consequences of such non-random errors on association tests for rare variants are unclear, particularly in transmission-based tests. In this study, we investigated the impact of genotyping errors on rare variant association tests of family-based sequence data. We performed a comprehensive analysis to study how genotype calling errors affect type I error and statistical power of transmission-based association tests using a variety of realistic parameters in family-based sequencing studies. In simulation studies, we found that biased genotype calling errors yielded not only an inflation of type I error but also a power loss of association tests. We further confirmed our observation using exome sequence data from an autism project. We concluded that non-symmetric genotype calling errors need careful consideration in the analysis of family-based sequence data and we provided practical guidance on ameliorating the test bias. Nature Publishing Group 2016-06-22 /pmc/articles/PMC4916415/ /pubmed/27328765 http://dx.doi.org/10.1038/srep28323 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Yan, Qi
Chen, Rui
Sutcliffe, James S.
Cook, Edwin H.
Weeks, Daniel E.
Li, Bingshan
Chen, Wei
The impact of genotype calling errors on family-based studies
title The impact of genotype calling errors on family-based studies
title_full The impact of genotype calling errors on family-based studies
title_fullStr The impact of genotype calling errors on family-based studies
title_full_unstemmed The impact of genotype calling errors on family-based studies
title_short The impact of genotype calling errors on family-based studies
title_sort impact of genotype calling errors on family-based studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916415/
https://www.ncbi.nlm.nih.gov/pubmed/27328765
http://dx.doi.org/10.1038/srep28323
work_keys_str_mv AT yanqi theimpactofgenotypecallingerrorsonfamilybasedstudies
AT chenrui theimpactofgenotypecallingerrorsonfamilybasedstudies
AT sutcliffejamess theimpactofgenotypecallingerrorsonfamilybasedstudies
AT cookedwinh theimpactofgenotypecallingerrorsonfamilybasedstudies
AT weeksdaniele theimpactofgenotypecallingerrorsonfamilybasedstudies
AT libingshan theimpactofgenotypecallingerrorsonfamilybasedstudies
AT chenwei theimpactofgenotypecallingerrorsonfamilybasedstudies
AT yanqi impactofgenotypecallingerrorsonfamilybasedstudies
AT chenrui impactofgenotypecallingerrorsonfamilybasedstudies
AT sutcliffejamess impactofgenotypecallingerrorsonfamilybasedstudies
AT cookedwinh impactofgenotypecallingerrorsonfamilybasedstudies
AT weeksdaniele impactofgenotypecallingerrorsonfamilybasedstudies
AT libingshan impactofgenotypecallingerrorsonfamilybasedstudies
AT chenwei impactofgenotypecallingerrorsonfamilybasedstudies