Cargando…

SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values

BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of...

Descripción completa

Detalles Bibliográficos
Autor principal: Lemire, Mathieu
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524809/
https://www.ncbi.nlm.nih.gov/pubmed/16803631
http://dx.doi.org/10.1186/1471-2156-7-40
_version_ 1782128851053182976
author Lemire, Mathieu
author_facet Lemire, Mathieu
author_sort Lemire, Mathieu
collection PubMed
description BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of new statistical tests. SLINK is a flexible simulation tool that has been widely used to generate the segregation and recombination processes of markers linked to, and possibly associated with, a trait locus, conditional on trait values in arbitrary pedigrees. In practice, its most serious limitation is the small number of loci that can be simulated, since the complexity of the algorithm scales exponentially with this number. RESULTS: I describe the implementation of a two-step algorithm to be used in conjunction with SLINK to enable the simulation of a large number of marker loci linked to a trait locus and conditional on trait values in families, with the possibility for the loci to be in linkage disequilibrium. SLINK is used in the first step to simulate genotypes at the trait locus conditional on the observed trait values, and also to generate an indicator of the descent path of the simulated alleles. In the second step, marker alleles or haplotypes are generated in the founders, conditional on the trait locus genotypes simulated in the first step. Then the recombination process between the marker loci takes place conditionally on the descent path and on the trait locus genotypes. This two-step implementation is often computationally faster than other software that are designed to generate marker data linked to, and possibly associated with, a trait locus. CONCLUSION: Because the proposed method uses SLINK to simulate the segregation process, it benefits from its flexibility: the trait may be qualitative with the possibility of defining different liability classes (which allows for the simulation of gene-environment interactions or even the simulation of multi-locus effects between unlinked susceptibility regions) or it may be quantitative and normally distributed. In particular, this implementation is the only one available that can generate a large number of marker loci conditional on the set of observed quantitative trait values in pedigrees.
format Text
id pubmed-1524809
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15248092006-07-29 SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values Lemire, Mathieu BMC Genet Software BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of new statistical tests. SLINK is a flexible simulation tool that has been widely used to generate the segregation and recombination processes of markers linked to, and possibly associated with, a trait locus, conditional on trait values in arbitrary pedigrees. In practice, its most serious limitation is the small number of loci that can be simulated, since the complexity of the algorithm scales exponentially with this number. RESULTS: I describe the implementation of a two-step algorithm to be used in conjunction with SLINK to enable the simulation of a large number of marker loci linked to a trait locus and conditional on trait values in families, with the possibility for the loci to be in linkage disequilibrium. SLINK is used in the first step to simulate genotypes at the trait locus conditional on the observed trait values, and also to generate an indicator of the descent path of the simulated alleles. In the second step, marker alleles or haplotypes are generated in the founders, conditional on the trait locus genotypes simulated in the first step. Then the recombination process between the marker loci takes place conditionally on the descent path and on the trait locus genotypes. This two-step implementation is often computationally faster than other software that are designed to generate marker data linked to, and possibly associated with, a trait locus. CONCLUSION: Because the proposed method uses SLINK to simulate the segregation process, it benefits from its flexibility: the trait may be qualitative with the possibility of defining different liability classes (which allows for the simulation of gene-environment interactions or even the simulation of multi-locus effects between unlinked susceptibility regions) or it may be quantitative and normally distributed. In particular, this implementation is the only one available that can generate a large number of marker loci conditional on the set of observed quantitative trait values in pedigrees. BioMed Central 2006-07-03 /pmc/articles/PMC1524809/ /pubmed/16803631 http://dx.doi.org/10.1186/1471-2156-7-40 Text en Copyright © 2006 Lemire; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Lemire, Mathieu
SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title_full SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title_fullStr SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title_full_unstemmed SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title_short SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
title_sort sup: an extension to slink to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524809/
https://www.ncbi.nlm.nih.gov/pubmed/16803631
http://dx.doi.org/10.1186/1471-2156-7-40
work_keys_str_mv AT lemiremathieu supanextensiontoslinktoallowalargernumberofmarkerlocitobesimulatedinpedigreesconditionalontraitvalues