Cargando…
Dealing with missing phase and missing data in phylogeny-based analysis
We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367603/ https://www.ncbi.nlm.nih.gov/pubmed/18466519 |
_version_ | 1782154331665989632 |
---|---|
author | Bardel, Claire Croiseau, Pascal Génin, Emmanuelle |
author_facet | Bardel, Claire Croiseau, Pascal Génin, Emmanuelle |
author_sort | Bardel, Claire |
collection | PubMed |
description | We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In this work, we suggest using a multiple imputation algorithm to deal with missing phase and missing data, prior to a phylogeny-based analysis. We used the simulated data of Genetic Analysis Workshop 15 (Problem 3, answer known) to assess the power of the phylogeny-based analysis to detect disease susceptibility loci after reconstruction of haplotypes by a multiple-imputation method. We compare, for various rates of missing data, the performance of the multiple imputation method with the performance achieved when considering only the most probable haplotypic configurations or the true phase. When only the phase is unknown, all methods perform approximately the same to identify disease susceptibility sites. In the presence of missing data however, the detection of disease susceptibility sites is significantly better when reconstructing haplotypes by multiple imputation than when considering only the best haplotype configurations. |
format | Text |
id | pubmed-2367603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23676032008-05-06 Dealing with missing phase and missing data in phylogeny-based analysis Bardel, Claire Croiseau, Pascal Génin, Emmanuelle BMC Proc Proceedings We recently described a new method to identify disease susceptibility loci, based on the analysis of the evolutionary relationships between haplotypes of cases and controls. However, haplotypes are often unknown and the problem of phase inference is even more crucial when there are missing data. In this work, we suggest using a multiple imputation algorithm to deal with missing phase and missing data, prior to a phylogeny-based analysis. We used the simulated data of Genetic Analysis Workshop 15 (Problem 3, answer known) to assess the power of the phylogeny-based analysis to detect disease susceptibility loci after reconstruction of haplotypes by a multiple-imputation method. We compare, for various rates of missing data, the performance of the multiple imputation method with the performance achieved when considering only the most probable haplotypic configurations or the true phase. When only the phase is unknown, all methods perform approximately the same to identify disease susceptibility sites. In the presence of missing data however, the detection of disease susceptibility sites is significantly better when reconstructing haplotypes by multiple imputation than when considering only the best haplotype configurations. BioMed Central 2007-12-18 /pmc/articles/PMC2367603/ /pubmed/18466519 Text en Copyright © 2007 Bardel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Bardel, Claire Croiseau, Pascal Génin, Emmanuelle Dealing with missing phase and missing data in phylogeny-based analysis |
title | Dealing with missing phase and missing data in phylogeny-based analysis |
title_full | Dealing with missing phase and missing data in phylogeny-based analysis |
title_fullStr | Dealing with missing phase and missing data in phylogeny-based analysis |
title_full_unstemmed | Dealing with missing phase and missing data in phylogeny-based analysis |
title_short | Dealing with missing phase and missing data in phylogeny-based analysis |
title_sort | dealing with missing phase and missing data in phylogeny-based analysis |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367603/ https://www.ncbi.nlm.nih.gov/pubmed/18466519 |
work_keys_str_mv | AT bardelclaire dealingwithmissingphaseandmissingdatainphylogenybasedanalysis AT croiseaupascal dealingwithmissingphaseandmissingdatainphylogenybasedanalysis AT geninemmanuelle dealingwithmissingphaseandmissingdatainphylogenybasedanalysis |