Cargando…

A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable

Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a milli...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Gaetano, Cornelia, Matullo, Giuseppe, Piazza, Alberto, Ursino, Moreno, Gasparini, Mauro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3565544/
https://www.ncbi.nlm.nih.gov/pubmed/23423242
http://dx.doi.org/10.4137/EBO.S10211
_version_ 1782258460000256000
author Di Gaetano, Cornelia
Matullo, Giuseppe
Piazza, Alberto
Ursino, Moreno
Gasparini, Mauro
author_facet Di Gaetano, Cornelia
Matullo, Giuseppe
Piazza, Alberto
Ursino, Moreno
Gasparini, Mauro
author_sort Di Gaetano, Cornelia
collection PubMed
description Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a million single nucleotide polymorphisms (SNPs) from the data set Human Genome Diversity Panel (HGDP-CEPH -CEPH) and we calculated Spearman’s correlation between absolute latitude and one of the two allele frequencies of each SNP. We selected SNPs with a correlation coefficient within the upper 1% tail of the distribution. We then used a criterion of proximity between significant variants to focus on DNA regions showing a continuous signal over a portion of the genome. Based on external information and genome annotations, we demonstrated that most regions with the strongest signals also have biological relevance. We believe this proximity requirement adds an edge to our novel method compared to the existing literature, highlighting several genes (for example DTNB, DOT1L, TPCN2, RELN, MSRA, NRG3) related to body size or shape, human height, hair color, and schizophrenia. Our approach can be applied generally to any measure of association between polymorphic frequencies and continuously varying environmental variables.
format Online
Article
Text
id pubmed-3565544
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-35655442013-02-19 A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable Di Gaetano, Cornelia Matullo, Giuseppe Piazza, Alberto Ursino, Moreno Gasparini, Mauro Evol Bioinform Online Methodology Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a million single nucleotide polymorphisms (SNPs) from the data set Human Genome Diversity Panel (HGDP-CEPH -CEPH) and we calculated Spearman’s correlation between absolute latitude and one of the two allele frequencies of each SNP. We selected SNPs with a correlation coefficient within the upper 1% tail of the distribution. We then used a criterion of proximity between significant variants to focus on DNA regions showing a continuous signal over a portion of the genome. Based on external information and genome annotations, we demonstrated that most regions with the strongest signals also have biological relevance. We believe this proximity requirement adds an edge to our novel method compared to the existing literature, highlighting several genes (for example DTNB, DOT1L, TPCN2, RELN, MSRA, NRG3) related to body size or shape, human height, hair color, and schizophrenia. Our approach can be applied generally to any measure of association between polymorphic frequencies and continuously varying environmental variables. Libertas Academica 2013-01-29 /pmc/articles/PMC3565544/ /pubmed/23423242 http://dx.doi.org/10.4137/EBO.S10211 Text en © 2013 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Methodology
Di Gaetano, Cornelia
Matullo, Giuseppe
Piazza, Alberto
Ursino, Moreno
Gasparini, Mauro
A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title_full A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title_fullStr A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title_full_unstemmed A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title_short A Proximity-Based Method to Identify Genomic Regions Correlated with a Continuously Varying Environmental Variable
title_sort proximity-based method to identify genomic regions correlated with a continuously varying environmental variable
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3565544/
https://www.ncbi.nlm.nih.gov/pubmed/23423242
http://dx.doi.org/10.4137/EBO.S10211
work_keys_str_mv AT digaetanocornelia aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT matullogiuseppe aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT piazzaalberto aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT ursinomoreno aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT gasparinimauro aproximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT digaetanocornelia proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT matullogiuseppe proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT piazzaalberto proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT ursinomoreno proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable
AT gasparinimauro proximitybasedmethodtoidentifygenomicregionscorrelatedwithacontinuouslyvaryingenvironmentalvariable