Cargando…
SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values
BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524809/ https://www.ncbi.nlm.nih.gov/pubmed/16803631 http://dx.doi.org/10.1186/1471-2156-7-40 |
_version_ | 1782128851053182976 |
---|---|
author | Lemire, Mathieu |
author_facet | Lemire, Mathieu |
author_sort | Lemire, Mathieu |
collection | PubMed |
description | BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of new statistical tests. SLINK is a flexible simulation tool that has been widely used to generate the segregation and recombination processes of markers linked to, and possibly associated with, a trait locus, conditional on trait values in arbitrary pedigrees. In practice, its most serious limitation is the small number of loci that can be simulated, since the complexity of the algorithm scales exponentially with this number. RESULTS: I describe the implementation of a two-step algorithm to be used in conjunction with SLINK to enable the simulation of a large number of marker loci linked to a trait locus and conditional on trait values in families, with the possibility for the loci to be in linkage disequilibrium. SLINK is used in the first step to simulate genotypes at the trait locus conditional on the observed trait values, and also to generate an indicator of the descent path of the simulated alleles. In the second step, marker alleles or haplotypes are generated in the founders, conditional on the trait locus genotypes simulated in the first step. Then the recombination process between the marker loci takes place conditionally on the descent path and on the trait locus genotypes. This two-step implementation is often computationally faster than other software that are designed to generate marker data linked to, and possibly associated with, a trait locus. CONCLUSION: Because the proposed method uses SLINK to simulate the segregation process, it benefits from its flexibility: the trait may be qualitative with the possibility of defining different liability classes (which allows for the simulation of gene-environment interactions or even the simulation of multi-locus effects between unlinked susceptibility regions) or it may be quantitative and normally distributed. In particular, this implementation is the only one available that can generate a large number of marker loci conditional on the set of observed quantitative trait values in pedigrees. |
format | Text |
id | pubmed-1524809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15248092006-07-29 SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values Lemire, Mathieu BMC Genet Software BACKGROUND: With the recent advances in high-throughput genotyping technologies that allow for large-scale association mapping of human complex traits, promising statistical designs and methods have been emerging. Efficient simulation software are key elements for the evaluation of the properties of new statistical tests. SLINK is a flexible simulation tool that has been widely used to generate the segregation and recombination processes of markers linked to, and possibly associated with, a trait locus, conditional on trait values in arbitrary pedigrees. In practice, its most serious limitation is the small number of loci that can be simulated, since the complexity of the algorithm scales exponentially with this number. RESULTS: I describe the implementation of a two-step algorithm to be used in conjunction with SLINK to enable the simulation of a large number of marker loci linked to a trait locus and conditional on trait values in families, with the possibility for the loci to be in linkage disequilibrium. SLINK is used in the first step to simulate genotypes at the trait locus conditional on the observed trait values, and also to generate an indicator of the descent path of the simulated alleles. In the second step, marker alleles or haplotypes are generated in the founders, conditional on the trait locus genotypes simulated in the first step. Then the recombination process between the marker loci takes place conditionally on the descent path and on the trait locus genotypes. This two-step implementation is often computationally faster than other software that are designed to generate marker data linked to, and possibly associated with, a trait locus. CONCLUSION: Because the proposed method uses SLINK to simulate the segregation process, it benefits from its flexibility: the trait may be qualitative with the possibility of defining different liability classes (which allows for the simulation of gene-environment interactions or even the simulation of multi-locus effects between unlinked susceptibility regions) or it may be quantitative and normally distributed. In particular, this implementation is the only one available that can generate a large number of marker loci conditional on the set of observed quantitative trait values in pedigrees. BioMed Central 2006-07-03 /pmc/articles/PMC1524809/ /pubmed/16803631 http://dx.doi.org/10.1186/1471-2156-7-40 Text en Copyright © 2006 Lemire; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Lemire, Mathieu SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title | SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title_full | SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title_fullStr | SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title_full_unstemmed | SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title_short | SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
title_sort | sup: an extension to slink to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524809/ https://www.ncbi.nlm.nih.gov/pubmed/16803631 http://dx.doi.org/10.1186/1471-2156-7-40 |
work_keys_str_mv | AT lemiremathieu supanextensiontoslinktoallowalargernumberofmarkerlocitobesimulatedinpedigreesconditionalontraitvalues |