Cargando…

WISARD: workbench for integrated superfast association studies for related datasets

BACKGROUND: A Mendelian transmission produces phenotypic and genetic relatedness between family members, giving family-based analytical methods an important role in genetic epidemiological studies—from heritability estimations to genetic association analyses. With the advance in genotyping technolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Sungyoung, Choi, Sungkyoung, Qiao, Dandi, Cho, Michael, Silverman, Edwin K., Park, Taesung, Won, Sungho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918457/
https://www.ncbi.nlm.nih.gov/pubmed/29697360
http://dx.doi.org/10.1186/s12920-018-0345-y
_version_ 1783317420603080704
author Lee, Sungyoung
Choi, Sungkyoung
Qiao, Dandi
Cho, Michael
Silverman, Edwin K.
Park, Taesung
Won, Sungho
author_facet Lee, Sungyoung
Choi, Sungkyoung
Qiao, Dandi
Cho, Michael
Silverman, Edwin K.
Park, Taesung
Won, Sungho
author_sort Lee, Sungyoung
collection PubMed
description BACKGROUND: A Mendelian transmission produces phenotypic and genetic relatedness between family members, giving family-based analytical methods an important role in genetic epidemiological studies—from heritability estimations to genetic association analyses. With the advance in genotyping technologies, whole-genome sequence data can be utilized for genetic epidemiological studies, and family-based samples may become more useful for detecting de novo mutations. However, genetic analyses employing family-based samples usually suffer from the complexity of the computational/statistical algorithms, and certain types of family designs, such as incorporating data from extended families, have rarely been used. RESULTS: We present a Workbench for Integrated Superfast Association studies for Related Data (WISARD) programmed in C/C++. WISARD enables the fast and a comprehensive analysis of SNP-chip and next-generation sequencing data on extended families, with applications from designing genetic studies to summarizing analysis results. In addition, WISARD can automatically be run in a fully multithreaded manner, and the integration of R software for visualization makes it more accessible to non-experts. CONCLUSIONS: Comparison with existing toolsets showed that WISARD is computationally suitable for integrated analysis of related subjects, and demonstrated that WISARD outperforms existing toolsets. WISARD has also been successfully utilized to analyze the large-scale massive sequencing dataset of chronic obstructive pulmonary disease data (COPD), and we identified multiple genes associated with COPD, which demonstrates its practical value. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0345-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5918457
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59184572018-04-30 WISARD: workbench for integrated superfast association studies for related datasets Lee, Sungyoung Choi, Sungkyoung Qiao, Dandi Cho, Michael Silverman, Edwin K. Park, Taesung Won, Sungho BMC Med Genomics Research BACKGROUND: A Mendelian transmission produces phenotypic and genetic relatedness between family members, giving family-based analytical methods an important role in genetic epidemiological studies—from heritability estimations to genetic association analyses. With the advance in genotyping technologies, whole-genome sequence data can be utilized for genetic epidemiological studies, and family-based samples may become more useful for detecting de novo mutations. However, genetic analyses employing family-based samples usually suffer from the complexity of the computational/statistical algorithms, and certain types of family designs, such as incorporating data from extended families, have rarely been used. RESULTS: We present a Workbench for Integrated Superfast Association studies for Related Data (WISARD) programmed in C/C++. WISARD enables the fast and a comprehensive analysis of SNP-chip and next-generation sequencing data on extended families, with applications from designing genetic studies to summarizing analysis results. In addition, WISARD can automatically be run in a fully multithreaded manner, and the integration of R software for visualization makes it more accessible to non-experts. CONCLUSIONS: Comparison with existing toolsets showed that WISARD is computationally suitable for integrated analysis of related subjects, and demonstrated that WISARD outperforms existing toolsets. WISARD has also been successfully utilized to analyze the large-scale massive sequencing dataset of chronic obstructive pulmonary disease data (COPD), and we identified multiple genes associated with COPD, which demonstrates its practical value. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0345-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-20 /pmc/articles/PMC5918457/ /pubmed/29697360 http://dx.doi.org/10.1186/s12920-018-0345-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lee, Sungyoung
Choi, Sungkyoung
Qiao, Dandi
Cho, Michael
Silverman, Edwin K.
Park, Taesung
Won, Sungho
WISARD: workbench for integrated superfast association studies for related datasets
title WISARD: workbench for integrated superfast association studies for related datasets
title_full WISARD: workbench for integrated superfast association studies for related datasets
title_fullStr WISARD: workbench for integrated superfast association studies for related datasets
title_full_unstemmed WISARD: workbench for integrated superfast association studies for related datasets
title_short WISARD: workbench for integrated superfast association studies for related datasets
title_sort wisard: workbench for integrated superfast association studies for related datasets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918457/
https://www.ncbi.nlm.nih.gov/pubmed/29697360
http://dx.doi.org/10.1186/s12920-018-0345-y
work_keys_str_mv AT leesungyoung wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT choisungkyoung wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT qiaodandi wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT chomichael wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT silvermanedwink wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT parktaesung wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets
AT wonsungho wisardworkbenchforintegratedsuperfastassociationstudiesforrelateddatasets