Cargando…

Exact association test for small size sequencing data

BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Not...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Joowon, Lee, Seungyeoun, Jang, Jin-Young, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918458/
https://www.ncbi.nlm.nih.gov/pubmed/29697368
http://dx.doi.org/10.1186/s12920-018-0344-z
_version_ 1783317420841107456
author Lee, Joowon
Lee, Seungyeoun
Jang, Jin-Young
Park, Taesung
author_facet Lee, Joowon
Lee, Seungyeoun
Jang, Jin-Young
Park, Taesung
author_sort Lee, Joowon
collection PubMed
description BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Notwithstanding, due to its-still high cost, NGS data is generally restricted to small sample sizes, that cannot be analyzed by most existing methods. METHODS: In this work, we propose a new exact association test for sequencing data that does not require a large sample approximation, which is applicable to both common and rare variants. Our method, based on the Generalized Cochran-Mantel-Haenszel (GCMH) statistic, was applied to NGS datasets from intraductal papillary mucinous neoplasm (IPMN) patients. IPMN is a unique pancreatic cancer subtype that can turn into an invasive and hard-to-treat metastatic disease. RESULTS: Application of our method to IPMN data successfully identified susceptible genes associated with progression of IPMN to pancreatic cancer. CONCLUSIONS: Our method is expected to identify disease-associated genetic variants more successfully, and corresponding signal pathways, improving our understanding of specific disease’s etiology and prognosis.
format Online
Article
Text
id pubmed-5918458
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59184582018-04-30 Exact association test for small size sequencing data Lee, Joowon Lee, Seungyeoun Jang, Jin-Young Park, Taesung BMC Med Genomics Research BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Notwithstanding, due to its-still high cost, NGS data is generally restricted to small sample sizes, that cannot be analyzed by most existing methods. METHODS: In this work, we propose a new exact association test for sequencing data that does not require a large sample approximation, which is applicable to both common and rare variants. Our method, based on the Generalized Cochran-Mantel-Haenszel (GCMH) statistic, was applied to NGS datasets from intraductal papillary mucinous neoplasm (IPMN) patients. IPMN is a unique pancreatic cancer subtype that can turn into an invasive and hard-to-treat metastatic disease. RESULTS: Application of our method to IPMN data successfully identified susceptible genes associated with progression of IPMN to pancreatic cancer. CONCLUSIONS: Our method is expected to identify disease-associated genetic variants more successfully, and corresponding signal pathways, improving our understanding of specific disease’s etiology and prognosis. BioMed Central 2018-04-20 /pmc/articles/PMC5918458/ /pubmed/29697368 http://dx.doi.org/10.1186/s12920-018-0344-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lee, Joowon
Lee, Seungyeoun
Jang, Jin-Young
Park, Taesung
Exact association test for small size sequencing data
title Exact association test for small size sequencing data
title_full Exact association test for small size sequencing data
title_fullStr Exact association test for small size sequencing data
title_full_unstemmed Exact association test for small size sequencing data
title_short Exact association test for small size sequencing data
title_sort exact association test for small size sequencing data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918458/
https://www.ncbi.nlm.nih.gov/pubmed/29697368
http://dx.doi.org/10.1186/s12920-018-0344-z
work_keys_str_mv AT leejoowon exactassociationtestforsmallsizesequencingdata
AT leeseungyeoun exactassociationtestforsmallsizesequencingdata
AT jangjinyoung exactassociationtestforsmallsizesequencingdata
AT parktaesung exactassociationtestforsmallsizesequencingdata