Cargando…

Cladistic analysis of genotype data-application to GAW15 Problem 3

Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Jung, Hsuan, Zhao, Keyan, Marjoram, Paul
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367549/
https://www.ncbi.nlm.nih.gov/pubmed/18466467
_version_ 1782154318427717632
author Jung, Hsuan
Zhao, Keyan
Marjoram, Paul
author_facet Jung, Hsuan
Zhao, Keyan
Marjoram, Paul
author_sort Jung, Hsuan
collection PubMed
description Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Genetic Analysis Workshop 15 Problem 3 simulated data (answers known). Our analysis attempts to capture similarities between individuals using a series of trees, and then looks for regions in which mutations on those trees can successfully explain a phenotype of interest. Existing varieties of such algorithms assume haplotypes are known, or have been inferred, an assumption that is often unrealistic for genome-wide data. We therefore present an extension of these methods that can successfully analyze genotype, rather than haplotype, data.
format Text
id pubmed-2367549
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23675492008-05-06 Cladistic analysis of genotype data-application to GAW15 Problem 3 Jung, Hsuan Zhao, Keyan Marjoram, Paul BMC Proc Proceedings Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Genetic Analysis Workshop 15 Problem 3 simulated data (answers known). Our analysis attempts to capture similarities between individuals using a series of trees, and then looks for regions in which mutations on those trees can successfully explain a phenotype of interest. Existing varieties of such algorithms assume haplotypes are known, or have been inferred, an assumption that is often unrealistic for genome-wide data. We therefore present an extension of these methods that can successfully analyze genotype, rather than haplotype, data. BioMed Central 2007-12-18 /pmc/articles/PMC2367549/ /pubmed/18466467 Text en Copyright © 2007 Jung et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Jung, Hsuan
Zhao, Keyan
Marjoram, Paul
Cladistic analysis of genotype data-application to GAW15 Problem 3
title Cladistic analysis of genotype data-application to GAW15 Problem 3
title_full Cladistic analysis of genotype data-application to GAW15 Problem 3
title_fullStr Cladistic analysis of genotype data-application to GAW15 Problem 3
title_full_unstemmed Cladistic analysis of genotype data-application to GAW15 Problem 3
title_short Cladistic analysis of genotype data-application to GAW15 Problem 3
title_sort cladistic analysis of genotype data-application to gaw15 problem 3
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367549/
https://www.ncbi.nlm.nih.gov/pubmed/18466467
work_keys_str_mv AT junghsuan cladisticanalysisofgenotypedataapplicationtogaw15problem3
AT zhaokeyan cladisticanalysisofgenotypedataapplicationtogaw15problem3
AT marjorampaul cladisticanalysisofgenotypedataapplicationtogaw15problem3