Cargando…

References for Haplotype Imputation in the Big Data Era

Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a m...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wenzhi, Xu, Wei, Li, Qiling, Ma, Li, Song, Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888899/
https://www.ncbi.nlm.nih.gov/pubmed/27274952
http://dx.doi.org/10.4172/2168-9547.1000143
_version_ 1782434919374389248
author Li, Wenzhi
Xu, Wei
Li, Qiling
Ma, Li
Song, Qing
author_facet Li, Wenzhi
Xu, Wei
Li, Qiling
Ma, Li
Song, Qing
author_sort Li, Wenzhi
collection PubMed
description Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a mismatched reference panel will significantly reduce the quality of imputation. However, currently existing big datasets cover only a small number of ethnicities, there is a lack of ethnicity-matched references for many ethnic populations in the world, which has hampered the data imputation of haplotypes and its downstream applications. To solve this issue, several approaches have been proposed and explored, including the mixed reference panel, the internal reference panel and genotype-converted reference panel. This review article provides the information and comparison between these approaches. Increasing evidence showed that not just one or two genetic elements dictate the gene activity and functions; instead, cis-interactions of multiple elements dictate gene activity. Cis-interactions require the interacting elements to be on the same chromosome molecule, therefore, haplotype analysis is essential for the investigation of cis-interactions among multiple genetic variants at different loci, and appears to be especially important for studying the common diseases. It will be valuable in a wide spectrum of applications from academic research, to clinical diagnosis, prevention, treatment, and pharmaceutical industry.
format Online
Article
Text
id pubmed-4888899
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-48888992016-06-01 References for Haplotype Imputation in the Big Data Era Li, Wenzhi Xu, Wei Li, Qiling Ma, Li Song, Qing Mol Biol (Los Angel) Article Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a mismatched reference panel will significantly reduce the quality of imputation. However, currently existing big datasets cover only a small number of ethnicities, there is a lack of ethnicity-matched references for many ethnic populations in the world, which has hampered the data imputation of haplotypes and its downstream applications. To solve this issue, several approaches have been proposed and explored, including the mixed reference panel, the internal reference panel and genotype-converted reference panel. This review article provides the information and comparison between these approaches. Increasing evidence showed that not just one or two genetic elements dictate the gene activity and functions; instead, cis-interactions of multiple elements dictate gene activity. Cis-interactions require the interacting elements to be on the same chromosome molecule, therefore, haplotype analysis is essential for the investigation of cis-interactions among multiple genetic variants at different loci, and appears to be especially important for studying the common diseases. It will be valuable in a wide spectrum of applications from academic research, to clinical diagnosis, prevention, treatment, and pharmaceutical industry. 2015-10-31 2015-11 /pmc/articles/PMC4888899/ /pubmed/27274952 http://dx.doi.org/10.4172/2168-9547.1000143 Text en http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Article
Li, Wenzhi
Xu, Wei
Li, Qiling
Ma, Li
Song, Qing
References for Haplotype Imputation in the Big Data Era
title References for Haplotype Imputation in the Big Data Era
title_full References for Haplotype Imputation in the Big Data Era
title_fullStr References for Haplotype Imputation in the Big Data Era
title_full_unstemmed References for Haplotype Imputation in the Big Data Era
title_short References for Haplotype Imputation in the Big Data Era
title_sort references for haplotype imputation in the big data era
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888899/
https://www.ncbi.nlm.nih.gov/pubmed/27274952
http://dx.doi.org/10.4172/2168-9547.1000143
work_keys_str_mv AT liwenzhi referencesforhaplotypeimputationinthebigdataera
AT xuwei referencesforhaplotypeimputationinthebigdataera
AT liqiling referencesforhaplotypeimputationinthebigdataera
AT mali referencesforhaplotypeimputationinthebigdataera
AT songqing referencesforhaplotypeimputationinthebigdataera