Cargando…
Identifying influential regions in extremely rare variants using a fixed-bin approach
In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287865/ https://www.ncbi.nlm.nih.gov/pubmed/22373412 http://dx.doi.org/10.1186/1753-6561-5-S9-S3 |
_version_ | 1782224761218138112 |
---|---|
author | Agne, Michael Huang, Chien-Hsun Hu, Inchi Wang, Haitian Zheng, Tian Lo, Shaw-Hwa |
author_facet | Agne, Michael Huang, Chien-Hsun Hu, Inchi Wang, Haitian Zheng, Tian Lo, Shaw-Hwa |
author_sort | Agne, Michael |
collection | PubMed |
description | In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the null hypothesis, the distribution of rare variants is assumed to be uniform over case (affected) and control (unaffected) subjects. We attempt to pinpoint regions where the composition is significantly different between case and control events, specifically where there are unusually high numbers of rare variants among affected subjects. We focus on private variants, which require a degree of “collapsing” to combine information over several SNPs, to obtain meaningful results. Instead of implementing a gene-based approach, where regions would vary in size and sometimes be too small to achieve a strong enough signal, we implement a fixed-bin approach, with a preset number of SNPs per region, relying on the assumption that proximity and similarity go hand in hand. Through application of 100-SNP and 30-SNP fixed bins, we identify several most influential regions, which later are seen to contain some of the causal SNPs. The 100- and 30-SNP approaches detected seven and three causal SNPs among the most significant regions, respectively, with two overlapping SNPs located in the ELAVL4 gene, reported by both procedures. |
format | Online Article Text |
id | pubmed-3287865 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32878652012-02-28 Identifying influential regions in extremely rare variants using a fixed-bin approach Agne, Michael Huang, Chien-Hsun Hu, Inchi Wang, Haitian Zheng, Tian Lo, Shaw-Hwa BMC Proc Proceedings In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the null hypothesis, the distribution of rare variants is assumed to be uniform over case (affected) and control (unaffected) subjects. We attempt to pinpoint regions where the composition is significantly different between case and control events, specifically where there are unusually high numbers of rare variants among affected subjects. We focus on private variants, which require a degree of “collapsing” to combine information over several SNPs, to obtain meaningful results. Instead of implementing a gene-based approach, where regions would vary in size and sometimes be too small to achieve a strong enough signal, we implement a fixed-bin approach, with a preset number of SNPs per region, relying on the assumption that proximity and similarity go hand in hand. Through application of 100-SNP and 30-SNP fixed bins, we identify several most influential regions, which later are seen to contain some of the causal SNPs. The 100- and 30-SNP approaches detected seven and three causal SNPs among the most significant regions, respectively, with two overlapping SNPs located in the ELAVL4 gene, reported by both procedures. BioMed Central 2011-11-29 /pmc/articles/PMC3287865/ /pubmed/22373412 http://dx.doi.org/10.1186/1753-6561-5-S9-S3 Text en Copyright ©2011 Agne et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Agne, Michael Huang, Chien-Hsun Hu, Inchi Wang, Haitian Zheng, Tian Lo, Shaw-Hwa Identifying influential regions in extremely rare variants using a fixed-bin approach |
title | Identifying influential regions in extremely rare variants using a fixed-bin approach |
title_full | Identifying influential regions in extremely rare variants using a fixed-bin approach |
title_fullStr | Identifying influential regions in extremely rare variants using a fixed-bin approach |
title_full_unstemmed | Identifying influential regions in extremely rare variants using a fixed-bin approach |
title_short | Identifying influential regions in extremely rare variants using a fixed-bin approach |
title_sort | identifying influential regions in extremely rare variants using a fixed-bin approach |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287865/ https://www.ncbi.nlm.nih.gov/pubmed/22373412 http://dx.doi.org/10.1186/1753-6561-5-S9-S3 |
work_keys_str_mv | AT agnemichael identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach AT huangchienhsun identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach AT huinchi identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach AT wanghaitian identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach AT zhengtian identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach AT loshawhwa identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach |