Cargando…

Dealing with missing phase and missing data in phylogeny-based analysis

We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Bardel, Claire, Croiseau, Pascal, Génin, Emmanuelle
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367603/
https://www.ncbi.nlm.nih.gov/pubmed/18466519
_version_ 1782154331665989632
author Bardel, Claire
Croiseau, Pascal
Génin, Emmanuelle
author_facet Bardel, Claire
Croiseau, Pascal
Génin, Emmanuelle
author_sort Bardel, Claire
collection PubMed
description We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In this work, we suggest using a multiple imputation algorithm to deal with missing phase and missing data, prior to a phylogeny-based analysis. We used the simulated data of Genetic Analysis Workshop 15 (Problem 3, answer known) to assess the power of the phylogeny-based analysis to detect disease susceptibility loci after reconstruction of haplotypes by a multiple-imputation method. We compare, for various rates of missing data, the performance of the multiple imputation method with the performance achieved when considering only the most probable haplotypic configurations or the true phase. When only the phase is unknown, all methods perform approximately the same to identify disease susceptibility sites. In the presence of missing data however, the detection of disease susceptibility sites is significantly better when reconstructing haplotypes by multiple imputation than when considering only the best haplotype configurations.
format Text
id pubmed-2367603
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23676032008-05-06 Dealing with missing phase and missing data in phylogeny-based analysis Bardel, Claire Croiseau, Pascal Génin, Emmanuelle BMC Proc Proceedings We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In this work, we suggest using a multiple imputation algorithm to deal with missing phase and missing data, prior to a phylogeny-based analysis. We used the simulated data of Genetic Analysis Workshop 15 (Problem 3, answer known) to assess the power of the phylogeny-based analysis to detect disease susceptibility loci after reconstruction of haplotypes by a multiple-imputation method. We compare, for various rates of missing data, the performance of the multiple imputation method with the performance achieved when considering only the most probable haplotypic configurations or the true phase. When only the phase is unknown, all methods perform approximately the same to identify disease susceptibility sites. In the presence of missing data however, the detection of disease susceptibility sites is significantly better when reconstructing haplotypes by multiple imputation than when considering only the best haplotype configurations. BioMed Central 2007-12-18 /pmc/articles/PMC2367603/ /pubmed/18466519 Text en Copyright © 2007 Bardel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Bardel, Claire
Croiseau, Pascal
Génin, Emmanuelle
Dealing with missing phase and missing data in phylogeny-based analysis
title Dealing with missing phase and missing data in phylogeny-based analysis
title_full Dealing with missing phase and missing data in phylogeny-based analysis
title_fullStr Dealing with missing phase and missing data in phylogeny-based analysis
title_full_unstemmed Dealing with missing phase and missing data in phylogeny-based analysis
title_short Dealing with missing phase and missing data in phylogeny-based analysis
title_sort dealing with missing phase and missing data in phylogeny-based analysis
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367603/
https://www.ncbi.nlm.nih.gov/pubmed/18466519
work_keys_str_mv AT bardelclaire dealingwithmissingphaseandmissingdatainphylogenybasedanalysis
AT croiseaupascal dealingwithmissingphaseandmissingdatainphylogenybasedanalysis
AT geninemmanuelle dealingwithmissingphaseandmissingdatainphylogenybasedanalysis