Cargando…

Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis

Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Jianzhong, Amos, Christopher I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3392271/
https://www.ncbi.nlm.nih.gov/pubmed/22808122
http://dx.doi.org/10.1371/journal.pone.0040224
_version_ 1782237613910917120
author Ma, Jianzhong
Amos, Christopher I.
author_facet Ma, Jianzhong
Amos, Christopher I.
author_sort Ma, Jianzhong
collection PubMed
description Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct “populations” of inversion homozygotes of different orientations and their 1∶1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases.
format Online
Article
Text
id pubmed-3392271
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33922712012-07-17 Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis Ma, Jianzhong Amos, Christopher I. PLoS One Research Article Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct “populations” of inversion homozygotes of different orientations and their 1∶1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases. Public Library of Science 2012-07-09 /pmc/articles/PMC3392271/ /pubmed/22808122 http://dx.doi.org/10.1371/journal.pone.0040224 Text en Ma, Amos. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ma, Jianzhong
Amos, Christopher I.
Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title_full Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title_fullStr Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title_full_unstemmed Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title_short Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
title_sort investigation of inversion polymorphisms in the human genome using principal components analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3392271/
https://www.ncbi.nlm.nih.gov/pubmed/22808122
http://dx.doi.org/10.1371/journal.pone.0040224
work_keys_str_mv AT majianzhong investigationofinversionpolymorphismsinthehumangenomeusingprincipalcomponentsanalysis
AT amoschristopheri investigationofinversionpolymorphismsinthehumangenomeusingprincipalcomponentsanalysis