Cargando…

A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants

We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yumei, Wang, Qishan, Chen, Qiang, Liao, Rongrong, Zhang, Xiangzhe, Yang, Hongjie, Zheng, Youmin, Zhang, Zhiwu, Pan, Yuchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4074155/
https://www.ncbi.nlm.nih.gov/pubmed/24972110
http://dx.doi.org/10.1371/journal.pone.0101025
Descripción
Sumario:We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e.g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to −0.1 and 0.03, while iBLUP still had an accuracy of 0.61.We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated.