Cargando…

A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions

The understanding of non-random association between loci, termed linkage disequilibrium (LD), plays a central role in genomic research. Since causal mutations are generally not included in genomic marker data, LD between those and available markers is essential for capturing the effects of causal lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Swetlana, Schlather, Martin, de los Campos, Gustavo, Weigend, Steffen, Preisinger, Rudolf, Erbe, Malena, Simianer, Henner
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4627745/
https://www.ncbi.nlm.nih.gov/pubmed/26517830
http://dx.doi.org/10.1371/journal.pone.0141216
_version_ 1782398323988103168
author Berger, Swetlana
Schlather, Martin
de los Campos, Gustavo
Weigend, Steffen
Preisinger, Rudolf
Erbe, Malena
Simianer, Henner
author_facet Berger, Swetlana
Schlather, Martin
de los Campos, Gustavo
Weigend, Steffen
Preisinger, Rudolf
Erbe, Malena
Simianer, Henner
author_sort Berger, Swetlana
collection PubMed
description The understanding of non-random association between loci, termed linkage disequilibrium (LD), plays a central role in genomic research. Since causal mutations are generally not included in genomic marker data, LD between those and available markers is essential for capturing the effects of causal loci on localizing genes responsible for traits. Thus, the interpretation of association studies requires a detailed knowledge of LD patterns. It is well known that most LD measures depend on minor allele frequencies (MAF) of the considered loci and the magnitude of LD is influenced by the physical distances between loci. In the present study, a procedure to compare the LD structure between genomic regions comprising several markers each is suggested. The approach accounts for different scaling factors, namely the distribution of MAF, the distribution of pair-wise differences in MAF, and the physical extent of compared regions, reflected by the distribution of pair-wise physical distances. In the first step, genomic regions are matched based on similarity in these scaling factors. In the second step, chromosome- and genome-wide significance tests for differences in medians of LD measures in each pair are performed. The proposed framework was applied to test the hypothesis that the average LD is different in genic and non-genic regions. This was tested with a genome-wide approach with data sets for humans (Homo sapiens), a highly selected chicken line (Gallus gallus domesticus) and the model plant Arabidopsis thaliana. In all three data sets we found a significantly higher level of LD in genic regions compared to non-genic regions. About 31% more LD was detected genome-wide in genic compared to non-genic regions in Arabidopsis thaliana, followed by 13.6% in human and 6% chicken. Chromosome-wide comparison discovered significant differences on all 5 chromosomes in Arabidopsis thaliana and on one third of the human and of the chicken chromosomes.
format Online
Article
Text
id pubmed-4627745
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46277452015-11-06 A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions Berger, Swetlana Schlather, Martin de los Campos, Gustavo Weigend, Steffen Preisinger, Rudolf Erbe, Malena Simianer, Henner PLoS One Research Article The understanding of non-random association between loci, termed linkage disequilibrium (LD), plays a central role in genomic research. Since causal mutations are generally not included in genomic marker data, LD between those and available markers is essential for capturing the effects of causal loci on localizing genes responsible for traits. Thus, the interpretation of association studies requires a detailed knowledge of LD patterns. It is well known that most LD measures depend on minor allele frequencies (MAF) of the considered loci and the magnitude of LD is influenced by the physical distances between loci. In the present study, a procedure to compare the LD structure between genomic regions comprising several markers each is suggested. The approach accounts for different scaling factors, namely the distribution of MAF, the distribution of pair-wise differences in MAF, and the physical extent of compared regions, reflected by the distribution of pair-wise physical distances. In the first step, genomic regions are matched based on similarity in these scaling factors. In the second step, chromosome- and genome-wide significance tests for differences in medians of LD measures in each pair are performed. The proposed framework was applied to test the hypothesis that the average LD is different in genic and non-genic regions. This was tested with a genome-wide approach with data sets for humans (Homo sapiens), a highly selected chicken line (Gallus gallus domesticus) and the model plant Arabidopsis thaliana. In all three data sets we found a significantly higher level of LD in genic regions compared to non-genic regions. About 31% more LD was detected genome-wide in genic compared to non-genic regions in Arabidopsis thaliana, followed by 13.6% in human and 6% chicken. Chromosome-wide comparison discovered significant differences on all 5 chromosomes in Arabidopsis thaliana and on one third of the human and of the chicken chromosomes. Public Library of Science 2015-10-30 /pmc/articles/PMC4627745/ /pubmed/26517830 http://dx.doi.org/10.1371/journal.pone.0141216 Text en © 2015 Berger et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Berger, Swetlana
Schlather, Martin
de los Campos, Gustavo
Weigend, Steffen
Preisinger, Rudolf
Erbe, Malena
Simianer, Henner
A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title_full A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title_fullStr A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title_full_unstemmed A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title_short A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions
title_sort scale-corrected comparison of linkage disequilibrium levels between genic and non-genic regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4627745/
https://www.ncbi.nlm.nih.gov/pubmed/26517830
http://dx.doi.org/10.1371/journal.pone.0141216
work_keys_str_mv AT bergerswetlana ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT schlathermartin ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT deloscamposgustavo ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT weigendsteffen ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT preisingerrudolf ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT erbemalena ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT simianerhenner ascalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT bergerswetlana scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT schlathermartin scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT deloscamposgustavo scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT weigendsteffen scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT preisingerrudolf scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT erbemalena scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions
AT simianerhenner scalecorrectedcomparisonoflinkagedisequilibriumlevelsbetweengenicandnongenicregions