Cargando…

An entropy test for single-locus genetic association analysis

BACKGROUND: The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests o...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruiz-Marín, Manuel, Matilla-García, Mariano, Cordoba, José Antonio García, Susillo-González, Juan Luis, Romo-Astorga, Alejandro, González-Pérez, Antonio, Ruiz, Agustín, Gayán, Javier
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2860340/
https://www.ncbi.nlm.nih.gov/pubmed/20331859
http://dx.doi.org/10.1186/1471-2156-11-19
_version_ 1782180569999736832
author Ruiz-Marín, Manuel
Matilla-García, Mariano
Cordoba, José Antonio García
Susillo-González, Juan Luis
Romo-Astorga, Alejandro
González-Pérez, Antonio
Ruiz, Agustín
Gayán, Javier
author_facet Ruiz-Marín, Manuel
Matilla-García, Mariano
Cordoba, José Antonio García
Susillo-González, Juan Luis
Romo-Astorga, Alejandro
González-Pérez, Antonio
Ruiz, Agustín
Gayán, Javier
author_sort Ruiz-Marín, Manuel
collection PubMed
description BACKGROUND: The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests of genetic association, and especially for the identification of rare effects RESULTS: We introduce a new genetic association test based on symbolic dynamics and symbolic entropy. Using a freely available software, we have applied this entropy test, and a conventional test, to simulated and real datasets, to illustrate the method and estimate type I error and power. We have also compared this new entropy test to the Fisher exact test for assessment of association with low-frequency SNPs. The entropy test is generally more powerful than the conventional test, and can be significantly more powerful when the genotypic test is applied to low allele-frequency markers. We have also shown that both the Fisher and Entropy methods are optimal to test for association with low-frequency SNPs (MAF around 1-5%), and both are conservative for very rare SNPs (MAF<1%) CONCLUSIONS: We have developed a new, simple, consistent and powerful test to detect genetic association of biallelic/SNP markers in case-control data, by using symbolic dynamics and symbolic entropy as a measure of gene dependence. We also provide a standard asymptotic distribution of this test statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. The entropy test is generally as good or even more powerful than the conventional and Fisher tests. Furthermore, the entropy test is more computationally efficient than the Fisher's Exact test, especially for large number of markers. Therefore, this entropy-based test has the advantage of being optimal for most SNPs, regardless of their allele frequency (Minor Allele Frequency (MAF) between 1-50%). This property is quite beneficial, since many researchers tend to discard low allele-frequency SNPs from their analysis. Now they can apply the same statistical test of association to all SNPs in a single analysis., which can be especially helpful to detect rare effects.
format Text
id pubmed-2860340
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28603402010-04-28 An entropy test for single-locus genetic association analysis Ruiz-Marín, Manuel Matilla-García, Mariano Cordoba, José Antonio García Susillo-González, Juan Luis Romo-Astorga, Alejandro González-Pérez, Antonio Ruiz, Agustín Gayán, Javier BMC Genet Methodology article BACKGROUND: The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests of genetic association, and especially for the identification of rare effects RESULTS: We introduce a new genetic association test based on symbolic dynamics and symbolic entropy. Using a freely available software, we have applied this entropy test, and a conventional test, to simulated and real datasets, to illustrate the method and estimate type I error and power. We have also compared this new entropy test to the Fisher exact test for assessment of association with low-frequency SNPs. The entropy test is generally more powerful than the conventional test, and can be significantly more powerful when the genotypic test is applied to low allele-frequency markers. We have also shown that both the Fisher and Entropy methods are optimal to test for association with low-frequency SNPs (MAF around 1-5%), and both are conservative for very rare SNPs (MAF<1%) CONCLUSIONS: We have developed a new, simple, consistent and powerful test to detect genetic association of biallelic/SNP markers in case-control data, by using symbolic dynamics and symbolic entropy as a measure of gene dependence. We also provide a standard asymptotic distribution of this test statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. The entropy test is generally as good or even more powerful than the conventional and Fisher tests. Furthermore, the entropy test is more computationally efficient than the Fisher's Exact test, especially for large number of markers. Therefore, this entropy-based test has the advantage of being optimal for most SNPs, regardless of their allele frequency (Minor Allele Frequency (MAF) between 1-50%). This property is quite beneficial, since many researchers tend to discard low allele-frequency SNPs from their analysis. Now they can apply the same statistical test of association to all SNPs in a single analysis., which can be especially helpful to detect rare effects. BioMed Central 2010-03-23 /pmc/articles/PMC2860340/ /pubmed/20331859 http://dx.doi.org/10.1186/1471-2156-11-19 Text en Copyright ©2010 Ruiz-Marín et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Ruiz-Marín, Manuel
Matilla-García, Mariano
Cordoba, José Antonio García
Susillo-González, Juan Luis
Romo-Astorga, Alejandro
González-Pérez, Antonio
Ruiz, Agustín
Gayán, Javier
An entropy test for single-locus genetic association analysis
title An entropy test for single-locus genetic association analysis
title_full An entropy test for single-locus genetic association analysis
title_fullStr An entropy test for single-locus genetic association analysis
title_full_unstemmed An entropy test for single-locus genetic association analysis
title_short An entropy test for single-locus genetic association analysis
title_sort entropy test for single-locus genetic association analysis
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2860340/
https://www.ncbi.nlm.nih.gov/pubmed/20331859
http://dx.doi.org/10.1186/1471-2156-11-19
work_keys_str_mv AT ruizmarinmanuel anentropytestforsinglelocusgeneticassociationanalysis
AT matillagarciamariano anentropytestforsinglelocusgeneticassociationanalysis
AT cordobajoseantoniogarcia anentropytestforsinglelocusgeneticassociationanalysis
AT susillogonzalezjuanluis anentropytestforsinglelocusgeneticassociationanalysis
AT romoastorgaalejandro anentropytestforsinglelocusgeneticassociationanalysis
AT gonzalezperezantonio anentropytestforsinglelocusgeneticassociationanalysis
AT ruizagustin anentropytestforsinglelocusgeneticassociationanalysis
AT gayanjavier anentropytestforsinglelocusgeneticassociationanalysis
AT ruizmarinmanuel entropytestforsinglelocusgeneticassociationanalysis
AT matillagarciamariano entropytestforsinglelocusgeneticassociationanalysis
AT cordobajoseantoniogarcia entropytestforsinglelocusgeneticassociationanalysis
AT susillogonzalezjuanluis entropytestforsinglelocusgeneticassociationanalysis
AT romoastorgaalejandro entropytestforsinglelocusgeneticassociationanalysis
AT gonzalezperezantonio entropytestforsinglelocusgeneticassociationanalysis
AT ruizagustin entropytestforsinglelocusgeneticassociationanalysis
AT gayanjavier entropytestforsinglelocusgeneticassociationanalysis