Cargando…
The impact of genotype calling errors on family-based studies
Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygo...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916415/ https://www.ncbi.nlm.nih.gov/pubmed/27328765 http://dx.doi.org/10.1038/srep28323 |
_version_ | 1782438825574793216 |
---|---|
author | Yan, Qi Chen, Rui Sutcliffe, James S. Cook, Edwin H. Weeks, Daniel E. Li, Bingshan Chen, Wei |
author_facet | Yan, Qi Chen, Rui Sutcliffe, James S. Cook, Edwin H. Weeks, Daniel E. Li, Bingshan Chen, Wei |
author_sort | Yan, Qi |
collection | PubMed |
description | Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygotes. The consequences of such non-random errors on association tests for rare variants are unclear, particularly in transmission-based tests. In this study, we investigated the impact of genotyping errors on rare variant association tests of family-based sequence data. We performed a comprehensive analysis to study how genotype calling errors affect type I error and statistical power of transmission-based association tests using a variety of realistic parameters in family-based sequencing studies. In simulation studies, we found that biased genotype calling errors yielded not only an inflation of type I error but also a power loss of association tests. We further confirmed our observation using exome sequence data from an autism project. We concluded that non-symmetric genotype calling errors need careful consideration in the analysis of family-based sequence data and we provided practical guidance on ameliorating the test bias. |
format | Online Article Text |
id | pubmed-4916415 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49164152016-06-27 The impact of genotype calling errors on family-based studies Yan, Qi Chen, Rui Sutcliffe, James S. Cook, Edwin H. Weeks, Daniel E. Li, Bingshan Chen, Wei Sci Rep Article Family-based sequencing studies have unique advantages in enriching rare variants, controlling population stratification, and improving genotype calling. Standard genotype calling algorithms are less likely to call rare variants correctly, often mistakenly calling heterozygotes as reference homozygotes. The consequences of such non-random errors on association tests for rare variants are unclear, particularly in transmission-based tests. In this study, we investigated the impact of genotyping errors on rare variant association tests of family-based sequence data. We performed a comprehensive analysis to study how genotype calling errors affect type I error and statistical power of transmission-based association tests using a variety of realistic parameters in family-based sequencing studies. In simulation studies, we found that biased genotype calling errors yielded not only an inflation of type I error but also a power loss of association tests. We further confirmed our observation using exome sequence data from an autism project. We concluded that non-symmetric genotype calling errors need careful consideration in the analysis of family-based sequence data and we provided practical guidance on ameliorating the test bias. Nature Publishing Group 2016-06-22 /pmc/articles/PMC4916415/ /pubmed/27328765 http://dx.doi.org/10.1038/srep28323 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Yan, Qi Chen, Rui Sutcliffe, James S. Cook, Edwin H. Weeks, Daniel E. Li, Bingshan Chen, Wei The impact of genotype calling errors on family-based studies |
title | The impact of genotype calling errors on family-based studies |
title_full | The impact of genotype calling errors on family-based studies |
title_fullStr | The impact of genotype calling errors on family-based studies |
title_full_unstemmed | The impact of genotype calling errors on family-based studies |
title_short | The impact of genotype calling errors on family-based studies |
title_sort | impact of genotype calling errors on family-based studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916415/ https://www.ncbi.nlm.nih.gov/pubmed/27328765 http://dx.doi.org/10.1038/srep28323 |
work_keys_str_mv | AT yanqi theimpactofgenotypecallingerrorsonfamilybasedstudies AT chenrui theimpactofgenotypecallingerrorsonfamilybasedstudies AT sutcliffejamess theimpactofgenotypecallingerrorsonfamilybasedstudies AT cookedwinh theimpactofgenotypecallingerrorsonfamilybasedstudies AT weeksdaniele theimpactofgenotypecallingerrorsonfamilybasedstudies AT libingshan theimpactofgenotypecallingerrorsonfamilybasedstudies AT chenwei theimpactofgenotypecallingerrorsonfamilybasedstudies AT yanqi impactofgenotypecallingerrorsonfamilybasedstudies AT chenrui impactofgenotypecallingerrorsonfamilybasedstudies AT sutcliffejamess impactofgenotypecallingerrorsonfamilybasedstudies AT cookedwinh impactofgenotypecallingerrorsonfamilybasedstudies AT weeksdaniele impactofgenotypecallingerrorsonfamilybasedstudies AT libingshan impactofgenotypecallingerrorsonfamilybasedstudies AT chenwei impactofgenotypecallingerrorsonfamilybasedstudies |