Cargando…

Using the longest significance run to estimate region-specific p-values in genetic association mapping studies

BACKGROUND: Association testing is a powerful tool for identifying disease susceptibility genes underlying complex diseases. Technological advances have yielded a dramatic increase in the density of available genetic markers, necessitating an increase in the number of association tests required for...

Descripción completa

Detalles Bibliográficos
Autores principales: Lian, Ie-Bin, Lin, Yi-Hsien, Lin, Ying-Chao, Yang, Hsin-Chou, Chang, Chee-Jang, Fann, Cathy SJ
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2430975/
https://www.ncbi.nlm.nih.gov/pubmed/18503718
http://dx.doi.org/10.1186/1471-2105-9-246
_version_ 1782156437791703040
author Lian, Ie-Bin
Lin, Yi-Hsien
Lin, Ying-Chao
Yang, Hsin-Chou
Chang, Chee-Jang
Fann, Cathy SJ
author_facet Lian, Ie-Bin
Lin, Yi-Hsien
Lin, Ying-Chao
Yang, Hsin-Chou
Chang, Chee-Jang
Fann, Cathy SJ
author_sort Lian, Ie-Bin
collection PubMed
description BACKGROUND: Association testing is a powerful tool for identifying disease susceptibility genes underlying complex diseases. Technological advances have yielded a dramatic increase in the density of available genetic markers, necessitating an increase in the number of association tests required for the analysis of disease susceptibility genes. As such, multiple-tests corrections have become a critical issue. However the conventional statistical corrections on locus-specific multiple tests usually result in lower power as the number of markers increases. Alternatively, we propose here the application of the longest significant run (LSR) method to estimate a region-specific p-value to provide an index for the most likely candidate region. RESULTS: An advantage of the LSR method relative to procedures based on genotypic data is that only p-value data are needed and hence can be applied extensively to different study designs. In this study the proposed LSR method was compared with commonly used methods such as Bonferroni's method and FDR controlling method. We found that while all methods provide good control over false positive rate, LSR has much better power and false discovery rate. In the authentic analysis on psoriasis and asthma disease data, the LSR method successfully identified important candidate regions and replicated the results of previous association studies. CONCLUSION: The proposed LSR method provides an efficient exploratory tool for the analysis of sequences of dense genetic markers. Our results show that the LSR method has better power and lower false discovery rate comparing with the locus-specific multiple tests.
format Text
id pubmed-2430975
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24309752008-06-19 Using the longest significance run to estimate region-specific p-values in genetic association mapping studies Lian, Ie-Bin Lin, Yi-Hsien Lin, Ying-Chao Yang, Hsin-Chou Chang, Chee-Jang Fann, Cathy SJ BMC Bioinformatics Methodology Article BACKGROUND: Association testing is a powerful tool for identifying disease susceptibility genes underlying complex diseases. Technological advances have yielded a dramatic increase in the density of available genetic markers, necessitating an increase in the number of association tests required for the analysis of disease susceptibility genes. As such, multiple-tests corrections have become a critical issue. However the conventional statistical corrections on locus-specific multiple tests usually result in lower power as the number of markers increases. Alternatively, we propose here the application of the longest significant run (LSR) method to estimate a region-specific p-value to provide an index for the most likely candidate region. RESULTS: An advantage of the LSR method relative to procedures based on genotypic data is that only p-value data are needed and hence can be applied extensively to different study designs. In this study the proposed LSR method was compared with commonly used methods such as Bonferroni's method and FDR controlling method. We found that while all methods provide good control over false positive rate, LSR has much better power and false discovery rate. In the authentic analysis on psoriasis and asthma disease data, the LSR method successfully identified important candidate regions and replicated the results of previous association studies. CONCLUSION: The proposed LSR method provides an efficient exploratory tool for the analysis of sequences of dense genetic markers. Our results show that the LSR method has better power and lower false discovery rate comparing with the locus-specific multiple tests. BioMed Central 2008-05-27 /pmc/articles/PMC2430975/ /pubmed/18503718 http://dx.doi.org/10.1186/1471-2105-9-246 Text en Copyright © 2008 Lian et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Lian, Ie-Bin
Lin, Yi-Hsien
Lin, Ying-Chao
Yang, Hsin-Chou
Chang, Chee-Jang
Fann, Cathy SJ
Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title_full Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title_fullStr Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title_full_unstemmed Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title_short Using the longest significance run to estimate region-specific p-values in genetic association mapping studies
title_sort using the longest significance run to estimate region-specific p-values in genetic association mapping studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2430975/
https://www.ncbi.nlm.nih.gov/pubmed/18503718
http://dx.doi.org/10.1186/1471-2105-9-246
work_keys_str_mv AT lianiebin usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies
AT linyihsien usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies
AT linyingchao usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies
AT yanghsinchou usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies
AT changcheejang usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies
AT fanncathysj usingthelongestsignificanceruntoestimateregionspecificpvaluesingeneticassociationmappingstudies