Cargando…

A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants

We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yumei, Wang, Qishan, Chen, Qiang, Liao, Rongrong, Zhang, Xiangzhe, Yang, Hongjie, Zheng, Youmin, Zhang, Zhiwu, Pan, Yuchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4074155/
https://www.ncbi.nlm.nih.gov/pubmed/24972110
http://dx.doi.org/10.1371/journal.pone.0101025
_version_ 1782323190392946688
author Yang, Yumei
Wang, Qishan
Chen, Qiang
Liao, Rongrong
Zhang, Xiangzhe
Yang, Hongjie
Zheng, Youmin
Zhang, Zhiwu
Pan, Yuchun
author_facet Yang, Yumei
Wang, Qishan
Chen, Qiang
Liao, Rongrong
Zhang, Xiangzhe
Yang, Hongjie
Zheng, Youmin
Zhang, Zhiwu
Pan, Yuchun
author_sort Yang, Yumei
collection PubMed
description We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e.g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to −0.1 and 0.03, while iBLUP still had an accuracy of 0.61.We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated.
format Online
Article
Text
id pubmed-4074155
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40741552014-07-02 A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants Yang, Yumei Wang, Qishan Chen, Qiang Liao, Rongrong Zhang, Xiangzhe Yang, Hongjie Zheng, Youmin Zhang, Zhiwu Pan, Yuchun PLoS One Research Article We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e.g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to −0.1 and 0.03, while iBLUP still had an accuracy of 0.61.We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated. Public Library of Science 2014-06-27 /pmc/articles/PMC4074155/ /pubmed/24972110 http://dx.doi.org/10.1371/journal.pone.0101025 Text en © 2014 Yang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yang, Yumei
Wang, Qishan
Chen, Qiang
Liao, Rongrong
Zhang, Xiangzhe
Yang, Hongjie
Zheng, Youmin
Zhang, Zhiwu
Pan, Yuchun
A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title_full A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title_fullStr A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title_full_unstemmed A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title_short A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants
title_sort new genotype imputation method with tolerance to high missing rate and rare variants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4074155/
https://www.ncbi.nlm.nih.gov/pubmed/24972110
http://dx.doi.org/10.1371/journal.pone.0101025
work_keys_str_mv AT yangyumei anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT wangqishan anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT chenqiang anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT liaorongrong anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhangxiangzhe anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT yanghongjie anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhengyoumin anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhangzhiwu anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT panyuchun anewgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT yangyumei newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT wangqishan newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT chenqiang newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT liaorongrong newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhangxiangzhe newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT yanghongjie newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhengyoumin newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT zhangzhiwu newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants
AT panyuchun newgenotypeimputationmethodwithtolerancetohighmissingrateandrarevariants