Cargando…

SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies

BACKGROUND: Association studies based on next-generation sequencing (NGS) technology have become popular, and statistical association tests for NGS data have been developed rapidly. A flexible tool for simulating sequence data in either unrelated case–control or family samples with different disease...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Ren-Hua, Shih, Chung-Chin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3693898/
https://www.ncbi.nlm.nih.gov/pubmed/23782512
http://dx.doi.org/10.1186/1471-2105-14-199
_version_ 1782274768672653312
author Chung, Ren-Hua
Shih, Chung-Chin
author_facet Chung, Ren-Hua
Shih, Chung-Chin
author_sort Chung, Ren-Hua
collection PubMed
description BACKGROUND: Association studies based on next-generation sequencing (NGS) technology have become popular, and statistical association tests for NGS data have been developed rapidly. A flexible tool for simulating sequence data in either unrelated case–control or family samples with different disease and quantitative trait models would be useful for evaluating the statistical power for planning a study design and for comparing power among statistical methods based on NGS data. RESULTS: We developed a simulation tool, SeqSIMLA, which can simulate sequence data with user-specified disease and quantitative trait models. We implemented two disease models, in which the user can flexibly specify the number of disease loci, effect sizes or population attributable risk, disease prevalence, and risk or protective loci. We also implemented a quantitative trait model, in which the user can specify the number of quantitative trait loci (QTL), proportions of variance explained by the QTL, and genetic models. We compiled recombination rates from the HapMap project so that genomic structures similar to the real data can be simulated. CONCLUSIONS: SeqSIMLA can efficiently simulate sequence data with disease or quantitative trait models specified by the user. SeqSIMLA will be very useful for evaluating statistical properties for new study designs and new statistical methods using NGS. SeqSIMLA can be downloaded for free at http://seqsimla.sourceforge.net.
format Online
Article
Text
id pubmed-3693898
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36938982013-06-28 SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies Chung, Ren-Hua Shih, Chung-Chin BMC Bioinformatics Software BACKGROUND: Association studies based on next-generation sequencing (NGS) technology have become popular, and statistical association tests for NGS data have been developed rapidly. A flexible tool for simulating sequence data in either unrelated case–control or family samples with different disease and quantitative trait models would be useful for evaluating the statistical power for planning a study design and for comparing power among statistical methods based on NGS data. RESULTS: We developed a simulation tool, SeqSIMLA, which can simulate sequence data with user-specified disease and quantitative trait models. We implemented two disease models, in which the user can flexibly specify the number of disease loci, effect sizes or population attributable risk, disease prevalence, and risk or protective loci. We also implemented a quantitative trait model, in which the user can specify the number of quantitative trait loci (QTL), proportions of variance explained by the QTL, and genetic models. We compiled recombination rates from the HapMap project so that genomic structures similar to the real data can be simulated. CONCLUSIONS: SeqSIMLA can efficiently simulate sequence data with disease or quantitative trait models specified by the user. SeqSIMLA will be very useful for evaluating statistical properties for new study designs and new statistical methods using NGS. SeqSIMLA can be downloaded for free at http://seqsimla.sourceforge.net. BioMed Central 2013-06-20 /pmc/articles/PMC3693898/ /pubmed/23782512 http://dx.doi.org/10.1186/1471-2105-14-199 Text en Copyright © 2013 Chung and Shih; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Chung, Ren-Hua
Shih, Chung-Chin
SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title_full SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title_fullStr SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title_full_unstemmed SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title_short SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies
title_sort seqsimla: a sequence and phenotype simulation tool for complex disease studies
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3693898/
https://www.ncbi.nlm.nih.gov/pubmed/23782512
http://dx.doi.org/10.1186/1471-2105-14-199
work_keys_str_mv AT chungrenhua seqsimlaasequenceandphenotypesimulationtoolforcomplexdiseasestudies
AT shihchungchin seqsimlaasequenceandphenotypesimulationtoolforcomplexdiseasestudies