Cargando…
A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a milli...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3565544/ https://www.ncbi.nlm.nih.gov/pubmed/23423242 http://dx.doi.org/10.4137/EBO.S10211 |
_version_ | 1782258460000256000 |
---|---|
author | Di Gaetano, Cornelia Matullo, Giuseppe Piazza, Alberto Ursino, Moreno Gasparini, Mauro |
author_facet | Di Gaetano, Cornelia Matullo, Giuseppe Piazza, Alberto Ursino, Moreno Gasparini, Mauro |
author_sort | Di Gaetano, Cornelia |
collection | PubMed |
description | Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a million single nucleotide polymorphisms (SNPs) from the data set Human Genome Diversity Panel (HGDP-CEPH -CEPH) and we calculated Spearman’s correlation between absolute latitude and one of the two allele frequencies of each SNP. We selected SNPs with a correlation coefficient within the upper 1% tail of the distribution. We then used a criterion of proximity between significant variants to focus on DNA regions showing a continuous signal over a portion of the genome. Based on external information and genome annotations, we demonstrated that most regions with the strongest signals also have biological relevance. We believe this proximity requirement adds an edge to our novel method compared to the existing literature, highlighting several genes (for example DTNB, DOT1L, TPCN2, RELN, MSRA, NRG3) related to body size or shape, human height, hair color, and schizophrenia. Our approach can be applied generally to any measure of association between polymorphic frequencies and continuously varying environmental variables. |
format | Online Article Text |
id | pubmed-3565544 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-35655442013-02-19 A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable Di Gaetano, Cornelia Matullo, Giuseppe Piazza, Alberto Ursino, Moreno Gasparini, Mauro Evol Bioinform Online Methodology Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a million single nucleotide polymorphisms (SNPs) from the data set Human Genome Diversity Panel (HGDP-CEPH -CEPH) and we calculated Spearman’s correlation between absolute latitude and one of the two allele frequencies of each SNP. We selected SNPs with a correlation coefficient within the upper 1% tail of the distribution. We then used a criterion of proximity between significant variants to focus on DNA regions showing a continuous signal over a portion of the genome. Based on external information and genome annotations, we demonstrated that most regions with the strongest signals also have biological relevance. We believe this proximity requirement adds an edge to our novel method compared to the existing literature, highlighting several genes (for example DTNB, DOT1L, TPCN2, RELN, MSRA, NRG3) related to body size or shape, human height, hair color, and schizophrenia. Our approach can be applied generally to any measure of association between polymorphic frequencies and continuously varying environmental variables. Libertas Academica 2013-01-29 /pmc/articles/PMC3565544/ /pubmed/23423242 http://dx.doi.org/10.4137/EBO.S10211 Text en © 2013 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited. |
spellingShingle | Methodology Di Gaetano, Cornelia Matullo, Giuseppe Piazza, Alberto Ursino, Moreno Gasparini, Mauro A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title | A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title_full | A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title_fullStr | A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title_full_unstemmed | A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title_short | A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable |
title_sort | proximity-based method to identify genomic regions correlated with a continuously varying environmental variable |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3565544/ https://www.ncbi.nlm.nih.gov/pubmed/23423242 http://dx.doi.org/10.4137/EBO.S10211 |
work_keys_str_mv | AT digaetanocornelia aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT matullogiuseppe aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT piazzaalberto aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT ursinomoreno aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT gasparinimauro aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT digaetanocornelia proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT matullogiuseppe proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT piazzaalberto proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT ursinomoreno proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable AT gasparinimauro proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable |