Cargando…
Exact association test for small size sequencing data
BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Not...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918458/ https://www.ncbi.nlm.nih.gov/pubmed/29697368 http://dx.doi.org/10.1186/s12920-018-0344-z |
_version_ | 1783317420841107456 |
---|---|
author | Lee, Joowon Lee, Seungyeoun Jang, Jin-Young Park, Taesung |
author_facet | Lee, Joowon Lee, Seungyeoun Jang, Jin-Young Park, Taesung |
author_sort | Lee, Joowon |
collection | PubMed |
description | BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Notwithstanding, due to its-still high cost, NGS data is generally restricted to small sample sizes, that cannot be analyzed by most existing methods. METHODS: In this work, we propose a new exact association test for sequencing data that does not require a large sample approximation, which is applicable to both common and rare variants. Our method, based on the Generalized Cochran-Mantel-Haenszel (GCMH) statistic, was applied to NGS datasets from intraductal papillary mucinous neoplasm (IPMN) patients. IPMN is a unique pancreatic cancer subtype that can turn into an invasive and hard-to-treat metastatic disease. RESULTS: Application of our method to IPMN data successfully identified susceptible genes associated with progression of IPMN to pancreatic cancer. CONCLUSIONS: Our method is expected to identify disease-associated genetic variants more successfully, and corresponding signal pathways, improving our understanding of specific disease’s etiology and prognosis. |
format | Online Article Text |
id | pubmed-5918458 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59184582018-04-30 Exact association test for small size sequencing data Lee, Joowon Lee, Seungyeoun Jang, Jin-Young Park, Taesung BMC Med Genomics Research BACKGROUND: Recent statistical methods for next generation sequencing (NGS) data have been successfully applied to identifying rare genetic variants associated with certain diseases. However, most commonly used methods (e.g., burden tests and variance-component tests) rely on large sample sizes. Notwithstanding, due to its-still high cost, NGS data is generally restricted to small sample sizes, that cannot be analyzed by most existing methods. METHODS: In this work, we propose a new exact association test for sequencing data that does not require a large sample approximation, which is applicable to both common and rare variants. Our method, based on the Generalized Cochran-Mantel-Haenszel (GCMH) statistic, was applied to NGS datasets from intraductal papillary mucinous neoplasm (IPMN) patients. IPMN is a unique pancreatic cancer subtype that can turn into an invasive and hard-to-treat metastatic disease. RESULTS: Application of our method to IPMN data successfully identified susceptible genes associated with progression of IPMN to pancreatic cancer. CONCLUSIONS: Our method is expected to identify disease-associated genetic variants more successfully, and corresponding signal pathways, improving our understanding of specific disease’s etiology and prognosis. BioMed Central 2018-04-20 /pmc/articles/PMC5918458/ /pubmed/29697368 http://dx.doi.org/10.1186/s12920-018-0344-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lee, Joowon Lee, Seungyeoun Jang, Jin-Young Park, Taesung Exact association test for small size sequencing data |
title | Exact association test for small size sequencing data |
title_full | Exact association test for small size sequencing data |
title_fullStr | Exact association test for small size sequencing data |
title_full_unstemmed | Exact association test for small size sequencing data |
title_short | Exact association test for small size sequencing data |
title_sort | exact association test for small size sequencing data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5918458/ https://www.ncbi.nlm.nih.gov/pubmed/29697368 http://dx.doi.org/10.1186/s12920-018-0344-z |
work_keys_str_mv | AT leejoowon exactassociationtestforsmallsizesequencingdata AT leeseungyeoun exactassociationtestforsmallsizesequencingdata AT jangjinyoung exactassociationtestforsmallsizesequencingdata AT parktaesung exactassociationtestforsmallsizesequencingdata |