Cargando…

Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis

Phased genome maps are important to understand genetic and epigenetic regulation and disease mechanisms, particularly parental imprinting defects. Phasing is also critical to assess the functional consequences of genetic variants, and to allow precise definition of haplotype blocks which is useful t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lajugie, Julien, Mukhopadhyay, Rituparna, Schizas, Michael, Lailler, Nathalie, Fourel, Nicolas, Bouhassira, Eric E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3669306/
https://www.ncbi.nlm.nih.gov/pubmed/23741343
http://dx.doi.org/10.1371/journal.pone.0064571
_version_ 1782271728999727104
author Lajugie, Julien
Mukhopadhyay, Rituparna
Schizas, Michael
Lailler, Nathalie
Fourel, Nicolas
Bouhassira, Eric E.
author_facet Lajugie, Julien
Mukhopadhyay, Rituparna
Schizas, Michael
Lailler, Nathalie
Fourel, Nicolas
Bouhassira, Eric E.
author_sort Lajugie, Julien
collection PubMed
description Phased genome maps are important to understand genetic and epigenetic regulation and disease mechanisms, particularly parental imprinting defects. Phasing is also critical to assess the functional consequences of genetic variants, and to allow precise definition of haplotype blocks which is useful to understand gene-flow and genotype-phenotype association at the population level. Transmission phasing by analysis of a family quartet allows the phasing of 95% of all variants as the uniformly heterozygous positions cannot be phased. Here, we report a phasing method based on a combination of transmission analysis, physical phasing by pair-end sequencing of libraries of staggered sizes and population-based analysis. Sequencing of a healthy Caucasians quartet at 120x coverage and combination of physical and transmission phasing yielded the phased genotypes of about 99.8% of the SNPs, indels and structural variants present in the quartet, a phasing rate significantly higher than what can be achieved using any single phasing method. A false positive SNP error rate below 10*E-7 per genome and per base was obtained using a combination of filters. We provide a complete list of SNPs, indels and structural variants, an analysis of haplotype block sizes, and an analysis of the false positive and negative variant calling error rates. Improved genome phasing and family sequencing will increase the power of genome-wide sequencing as a clinical diagnosis tool and has myriad basic science applications.
format Online
Article
Text
id pubmed-3669306
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36693062013-06-05 Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis Lajugie, Julien Mukhopadhyay, Rituparna Schizas, Michael Lailler, Nathalie Fourel, Nicolas Bouhassira, Eric E. PLoS One Research Article Phased genome maps are important to understand genetic and epigenetic regulation and disease mechanisms, particularly parental imprinting defects. Phasing is also critical to assess the functional consequences of genetic variants, and to allow precise definition of haplotype blocks which is useful to understand gene-flow and genotype-phenotype association at the population level. Transmission phasing by analysis of a family quartet allows the phasing of 95% of all variants as the uniformly heterozygous positions cannot be phased. Here, we report a phasing method based on a combination of transmission analysis, physical phasing by pair-end sequencing of libraries of staggered sizes and population-based analysis. Sequencing of a healthy Caucasians quartet at 120x coverage and combination of physical and transmission phasing yielded the phased genotypes of about 99.8% of the SNPs, indels and structural variants present in the quartet, a phasing rate significantly higher than what can be achieved using any single phasing method. A false positive SNP error rate below 10*E-7 per genome and per base was obtained using a combination of filters. We provide a complete list of SNPs, indels and structural variants, an analysis of haplotype block sizes, and an analysis of the false positive and negative variant calling error rates. Improved genome phasing and family sequencing will increase the power of genome-wide sequencing as a clinical diagnosis tool and has myriad basic science applications. Public Library of Science 2013-05-31 /pmc/articles/PMC3669306/ /pubmed/23741343 http://dx.doi.org/10.1371/journal.pone.0064571 Text en © 2013 Lajugie et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lajugie, Julien
Mukhopadhyay, Rituparna
Schizas, Michael
Lailler, Nathalie
Fourel, Nicolas
Bouhassira, Eric E.
Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title_full Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title_fullStr Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title_full_unstemmed Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title_short Complete Genome Phasing of Family Quartet by Combination of Genetic, Physical and Population-Based Phasing Analysis
title_sort complete genome phasing of family quartet by combination of genetic, physical and population-based phasing analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3669306/
https://www.ncbi.nlm.nih.gov/pubmed/23741343
http://dx.doi.org/10.1371/journal.pone.0064571
work_keys_str_mv AT lajugiejulien completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis
AT mukhopadhyayrituparna completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis
AT schizasmichael completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis
AT laillernathalie completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis
AT fourelnicolas completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis
AT bouhassiraerice completegenomephasingoffamilyquartetbycombinationofgeneticphysicalandpopulationbasedphasinganalysis