Cargando…

A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle

Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses s...

Descripción completa

Detalles Bibliográficos
Autores principales: Srikanth, Krishnamoorthy, Park, Jong-Eun, Lim, Dajeong, Cha, Jihye, Cho, Sang-Rae, Cho, In-Cheol, Park, Woncheoul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140831/
https://www.ncbi.nlm.nih.gov/pubmed/32245072
http://dx.doi.org/10.3390/genes11030332
_version_ 1783519078919438336
author Srikanth, Krishnamoorthy
Park, Jong-Eun
Lim, Dajeong
Cha, Jihye
Cho, Sang-Rae
Cho, In-Cheol
Park, Woncheoul
author_facet Srikanth, Krishnamoorthy
Park, Jong-Eun
Lim, Dajeong
Cha, Jihye
Cho, Sang-Rae
Cho, In-Cheol
Park, Woncheoul
author_sort Srikanth, Krishnamoorthy
collection PubMed
description Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses short-read sequencers along with library preparation protocols that facilitates long-read assemblies have greatly reduced the complexities of genome scale phasing. Moreover, it is possible to accurately assemble phased genome of individual samples using these methods. Therefore, in this study, we compared three phasing strategies which included two sample preparation methods along with the Long Ranger pipeline of 10X genomics and HapCut2 software, namely 10X-LG, 10X-HapCut2, and HiC-HapCut2 and assessed their performance and accuracy. We found that the 10X-LG had the best phasing performance amongst the method analyzed. They had the highest phasing rate (89.6%), longest adjusted N50 (1.24 Mb), and lowest switch error rate (0.07%). Moreover, the phasing accuracy and yield of the 10X-LG stayed over 90% for distances up to 4 Mb and 550 Kb respectively, which were considerably higher than 10X-HapCut2 and Hi-C Hapcut2. The results of this study will serve as a good reference for future benchmarking studies and also for reference-based imputation in Hanwoo.
format Online
Article
Text
id pubmed-7140831
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71408312020-04-10 A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle Srikanth, Krishnamoorthy Park, Jong-Eun Lim, Dajeong Cha, Jihye Cho, Sang-Rae Cho, In-Cheol Park, Woncheoul Genes (Basel) Communication Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses short-read sequencers along with library preparation protocols that facilitates long-read assemblies have greatly reduced the complexities of genome scale phasing. Moreover, it is possible to accurately assemble phased genome of individual samples using these methods. Therefore, in this study, we compared three phasing strategies which included two sample preparation methods along with the Long Ranger pipeline of 10X genomics and HapCut2 software, namely 10X-LG, 10X-HapCut2, and HiC-HapCut2 and assessed their performance and accuracy. We found that the 10X-LG had the best phasing performance amongst the method analyzed. They had the highest phasing rate (89.6%), longest adjusted N50 (1.24 Mb), and lowest switch error rate (0.07%). Moreover, the phasing accuracy and yield of the 10X-LG stayed over 90% for distances up to 4 Mb and 550 Kb respectively, which were considerably higher than 10X-HapCut2 and Hi-C Hapcut2. The results of this study will serve as a good reference for future benchmarking studies and also for reference-based imputation in Hanwoo. MDPI 2020-03-20 /pmc/articles/PMC7140831/ /pubmed/32245072 http://dx.doi.org/10.3390/genes11030332 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Srikanth, Krishnamoorthy
Park, Jong-Eun
Lim, Dajeong
Cha, Jihye
Cho, Sang-Rae
Cho, In-Cheol
Park, Woncheoul
A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title_full A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title_fullStr A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title_full_unstemmed A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title_short A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
title_sort comparison between hi-c and 10x genomics linked read sequencing for whole genome phasing in hanwoo cattle
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140831/
https://www.ncbi.nlm.nih.gov/pubmed/32245072
http://dx.doi.org/10.3390/genes11030332
work_keys_str_mv AT srikanthkrishnamoorthy acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT parkjongeun acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT limdajeong acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT chajihye acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT chosangrae acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT choincheol acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT parkwoncheoul acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT srikanthkrishnamoorthy comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT parkjongeun comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT limdajeong comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT chajihye comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT chosangrae comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT choincheol comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle
AT parkwoncheoul comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle