Cargando…

An accurate clone-based haplotyping method by overlapping pool sequencing

Chromosome-long haplotyping of human genomes is important to identify genetic variants with differing gene expression, in human evolution studies, clinical diagnosis, and other biological and medical fields. Although several methods have realized haplotyping based on sequencing technologies or popul...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Cheng, Cao, Changchang, Tu, Jing, Sun, Xiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937318/
https://www.ncbi.nlm.nih.gov/pubmed/27095193
http://dx.doi.org/10.1093/nar/gkw284
_version_ 1782441690378797056
author Li, Cheng
Cao, Changchang
Tu, Jing
Sun, Xiao
author_facet Li, Cheng
Cao, Changchang
Tu, Jing
Sun, Xiao
author_sort Li, Cheng
collection PubMed
description Chromosome-long haplotyping of human genomes is important to identify genetic variants with differing gene expression, in human evolution studies, clinical diagnosis, and other biological and medical fields. Although several methods have realized haplotyping based on sequencing technologies or population statistics, accuracy and cost are factors that prohibit their wide use. Borrowing ideas from group testing theories, we proposed a clone-based haplotyping method by overlapping pool sequencing. The clones from a single individual were pooled combinatorially and then sequenced. According to the distinct pooling pattern for each clone in the overlapping pool sequencing, alleles for the recovered variants could be assigned to their original clones precisely. Subsequently, the clone sequences could be reconstructed by linking these alleles accordingly and assembling them into haplotypes with high accuracy. To verify the utility of our method, we constructed 130 110 clones in silico for the individual NA12878 and simulated the pooling and sequencing process. Ultimately, 99.9% of variants on chromosome 1 that were covered by clones from both parental chromosomes were recovered correctly, and 112 haplotype contigs were assembled with an N50 length of 3.4 Mb and no switch errors. A comparison with current clone-based haplotyping methods indicated our method was more accurate.
format Online
Article
Text
id pubmed-4937318
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49373182016-07-11 An accurate clone-based haplotyping method by overlapping pool sequencing Li, Cheng Cao, Changchang Tu, Jing Sun, Xiao Nucleic Acids Res Methods Online Chromosome-long haplotyping of human genomes is important to identify genetic variants with differing gene expression, in human evolution studies, clinical diagnosis, and other biological and medical fields. Although several methods have realized haplotyping based on sequencing technologies or population statistics, accuracy and cost are factors that prohibit their wide use. Borrowing ideas from group testing theories, we proposed a clone-based haplotyping method by overlapping pool sequencing. The clones from a single individual were pooled combinatorially and then sequenced. According to the distinct pooling pattern for each clone in the overlapping pool sequencing, alleles for the recovered variants could be assigned to their original clones precisely. Subsequently, the clone sequences could be reconstructed by linking these alleles accordingly and assembling them into haplotypes with high accuracy. To verify the utility of our method, we constructed 130 110 clones in silico for the individual NA12878 and simulated the pooling and sequencing process. Ultimately, 99.9% of variants on chromosome 1 that were covered by clones from both parental chromosomes were recovered correctly, and 112 haplotype contigs were assembled with an N50 length of 3.4 Mb and no switch errors. A comparison with current clone-based haplotyping methods indicated our method was more accurate. Oxford University Press 2016-07-08 2016-04-19 /pmc/articles/PMC4937318/ /pubmed/27095193 http://dx.doi.org/10.1093/nar/gkw284 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Li, Cheng
Cao, Changchang
Tu, Jing
Sun, Xiao
An accurate clone-based haplotyping method by overlapping pool sequencing
title An accurate clone-based haplotyping method by overlapping pool sequencing
title_full An accurate clone-based haplotyping method by overlapping pool sequencing
title_fullStr An accurate clone-based haplotyping method by overlapping pool sequencing
title_full_unstemmed An accurate clone-based haplotyping method by overlapping pool sequencing
title_short An accurate clone-based haplotyping method by overlapping pool sequencing
title_sort accurate clone-based haplotyping method by overlapping pool sequencing
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937318/
https://www.ncbi.nlm.nih.gov/pubmed/27095193
http://dx.doi.org/10.1093/nar/gkw284
work_keys_str_mv AT licheng anaccurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT caochangchang anaccurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT tujing anaccurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT sunxiao anaccurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT licheng accurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT caochangchang accurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT tujing accurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing
AT sunxiao accurateclonebasedhaplotypingmethodbyoverlappingpoolsequencing