Cargando…

Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics

Large-scale copy number variants (CNVs) in the human provide the raw material for delineating population differences, as natural selection may have affected at least some of the CNVs thus far discovered. Although the examination of relatively large numbers of specific ethnic groups has recently star...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Hae-Hiang, Hu, Hae-Jin, Seok, In-Hae, Chung, Yeun-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480682/
https://www.ncbi.nlm.nih.gov/pubmed/23105934
http://dx.doi.org/10.5808/GI.2012.10.2.81
_version_ 1782247597587562496
author Song, Hae-Hiang
Hu, Hae-Jin
Seok, In-Hae
Chung, Yeun-Jun
author_facet Song, Hae-Hiang
Hu, Hae-Jin
Seok, In-Hae
Chung, Yeun-Jun
author_sort Song, Hae-Hiang
collection PubMed
description Large-scale copy number variants (CNVs) in the human provide the raw material for delineating population differences, as natural selection may have affected at least some of the CNVs thus far discovered. Although the examination of relatively large numbers of specific ethnic groups has recently started in regard to inter-ethnic group differences in CNVs, identifying and understanding particular instances of natural selection have not been performed. The traditional F(ST) measure, obtained from differences in allele frequencies between populations, has been used to identify CNVs loci subject to geographically varying selection. Here, we review advances and the application of multinomial-Dirichlet likelihood methods of inference for identifying genome regions that have been subject to natural selection with the F(ST) estimates. The contents of presentation are not new; however, this review clarifies how the application of the methods to CNV data, which remains largely unexplored, is possible. A hierarchical Bayesian method, which is implemented via Markov Chain Monte Carlo, estimates locus-specific F(ST) and can identify outlying CNVs loci with large values of F(ST). By applying this Bayesian method to the publicly available CNV data, we identified the CNV loci that show signals of natural selection, which may elucidate the genetic basis of human disease and diversity.
format Online
Article
Text
id pubmed-3480682
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-34806822012-10-26 Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics Song, Hae-Hiang Hu, Hae-Jin Seok, In-Hae Chung, Yeun-Jun Genomics Inf Review Large-scale copy number variants (CNVs) in the human provide the raw material for delineating population differences, as natural selection may have affected at least some of the CNVs thus far discovered. Although the examination of relatively large numbers of specific ethnic groups has recently started in regard to inter-ethnic group differences in CNVs, identifying and understanding particular instances of natural selection have not been performed. The traditional F(ST) measure, obtained from differences in allele frequencies between populations, has been used to identify CNVs loci subject to geographically varying selection. Here, we review advances and the application of multinomial-Dirichlet likelihood methods of inference for identifying genome regions that have been subject to natural selection with the F(ST) estimates. The contents of presentation are not new; however, this review clarifies how the application of the methods to CNV data, which remains largely unexplored, is possible. A hierarchical Bayesian method, which is implemented via Markov Chain Monte Carlo, estimates locus-specific F(ST) and can identify outlying CNVs loci with large values of F(ST). By applying this Bayesian method to the publicly available CNV data, we identified the CNV loci that show signals of natural selection, which may elucidate the genetic basis of human disease and diversity. Korea Genome Organization 2012-06 2012-06-30 /pmc/articles/PMC3480682/ /pubmed/23105934 http://dx.doi.org/10.5808/GI.2012.10.2.81 Text en Copyright © 2012 by The Korea Genome Organization http://creativecommons.org/licenses/by-nc/3.0 It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/).
spellingShingle Review
Song, Hae-Hiang
Hu, Hae-Jin
Seok, In-Hae
Chung, Yeun-Jun
Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title_full Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title_fullStr Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title_full_unstemmed Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title_short Identifying Copy Number Variants under Selection in Geographically Structured Populations Based on F-statistics
title_sort identifying copy number variants under selection in geographically structured populations based on f-statistics
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480682/
https://www.ncbi.nlm.nih.gov/pubmed/23105934
http://dx.doi.org/10.5808/GI.2012.10.2.81
work_keys_str_mv AT songhaehiang identifyingcopynumbervariantsunderselectioningeographicallystructuredpopulationsbasedonfstatistics
AT huhaejin identifyingcopynumbervariantsunderselectioningeographicallystructuredpopulationsbasedonfstatistics
AT seokinhae identifyingcopynumbervariantsunderselectioningeographicallystructuredpopulationsbasedonfstatistics
AT chungyeunjun identifyingcopynumbervariantsunderselectioningeographicallystructuredpopulationsbasedonfstatistics