Cargando…

Identifying influential regions in extremely rare variants using a fixed-bin approach

In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the...

Descripción completa

Detalles Bibliográficos
Autores principales: Agne, Michael, Huang, Chien-Hsun, Hu, Inchi, Wang, Haitian, Zheng, Tian, Lo, Shaw-Hwa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287865/
https://www.ncbi.nlm.nih.gov/pubmed/22373412
http://dx.doi.org/10.1186/1753-6561-5-S9-S3
_version_ 1782224761218138112
author Agne, Michael
Huang, Chien-Hsun
Hu, Inchi
Wang, Haitian
Zheng, Tian
Lo, Shaw-Hwa
author_facet Agne, Michael
Huang, Chien-Hsun
Hu, Inchi
Wang, Haitian
Zheng, Tian
Lo, Shaw-Hwa
author_sort Agne, Michael
collection PubMed
description In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the null hypothesis, the distribution of rare variants is assumed to be uniform over case (affected) and control (unaffected) subjects. We attempt to pinpoint regions where the composition is significantly different between case and control events, specifically where there are unusually high numbers of rare variants among affected subjects. We focus on private variants, which require a degree of “collapsing” to combine information over several SNPs, to obtain meaningful results. Instead of implementing a gene-based approach, where regions would vary in size and sometimes be too small to achieve a strong enough signal, we implement a fixed-bin approach, with a preset number of SNPs per region, relying on the assumption that proximity and similarity go hand in hand. Through application of 100-SNP and 30-SNP fixed bins, we identify several most influential regions, which later are seen to contain some of the causal SNPs. The 100- and 30-SNP approaches detected seven and three causal SNPs among the most significant regions, respectively, with two overlapping SNPs located in the ELAVL4 gene, reported by both procedures.
format Online
Article
Text
id pubmed-3287865
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32878652012-02-28 Identifying influential regions in extremely rare variants using a fixed-bin approach Agne, Michael Huang, Chien-Hsun Hu, Inchi Wang, Haitian Zheng, Tian Lo, Shaw-Hwa BMC Proc Proceedings In this study, we analyze the Genetic Analysis Workshop 17 data to identify regions of single-nucleotide polymorphisms (SNPs) that exhibit a significant influence on response rate (proportion of subjects with an affirmative affected status), called the affected ratio, among rare variants. Under the null hypothesis, the distribution of rare variants is assumed to be uniform over case (affected) and control (unaffected) subjects. We attempt to pinpoint regions where the composition is significantly different between case and control events, specifically where there are unusually high numbers of rare variants among affected subjects. We focus on private variants, which require a degree of “collapsing” to combine information over several SNPs, to obtain meaningful results. Instead of implementing a gene-based approach, where regions would vary in size and sometimes be too small to achieve a strong enough signal, we implement a fixed-bin approach, with a preset number of SNPs per region, relying on the assumption that proximity and similarity go hand in hand. Through application of 100-SNP and 30-SNP fixed bins, we identify several most influential regions, which later are seen to contain some of the causal SNPs. The 100- and 30-SNP approaches detected seven and three causal SNPs among the most significant regions, respectively, with two overlapping SNPs located in the ELAVL4 gene, reported by both procedures. BioMed Central 2011-11-29 /pmc/articles/PMC3287865/ /pubmed/22373412 http://dx.doi.org/10.1186/1753-6561-5-S9-S3 Text en Copyright ©2011 Agne et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Agne, Michael
Huang, Chien-Hsun
Hu, Inchi
Wang, Haitian
Zheng, Tian
Lo, Shaw-Hwa
Identifying influential regions in extremely rare variants using a fixed-bin approach
title Identifying influential regions in extremely rare variants using a fixed-bin approach
title_full Identifying influential regions in extremely rare variants using a fixed-bin approach
title_fullStr Identifying influential regions in extremely rare variants using a fixed-bin approach
title_full_unstemmed Identifying influential regions in extremely rare variants using a fixed-bin approach
title_short Identifying influential regions in extremely rare variants using a fixed-bin approach
title_sort identifying influential regions in extremely rare variants using a fixed-bin approach
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287865/
https://www.ncbi.nlm.nih.gov/pubmed/22373412
http://dx.doi.org/10.1186/1753-6561-5-S9-S3
work_keys_str_mv AT agnemichael identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach
AT huangchienhsun identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach
AT huinchi identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach
AT wanghaitian identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach
AT zhengtian identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach
AT loshawhwa identifyinginfluentialregionsinextremelyrarevariantsusingafixedbinapproach