Cargando…
A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle
Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses s...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140831/ https://www.ncbi.nlm.nih.gov/pubmed/32245072 http://dx.doi.org/10.3390/genes11030332 |
_version_ | 1783519078919438336 |
---|---|
author | Srikanth, Krishnamoorthy Park, Jong-Eun Lim, Dajeong Cha, Jihye Cho, Sang-Rae Cho, In-Cheol Park, Woncheoul |
author_facet | Srikanth, Krishnamoorthy Park, Jong-Eun Lim, Dajeong Cha, Jihye Cho, Sang-Rae Cho, In-Cheol Park, Woncheoul |
author_sort | Srikanth, Krishnamoorthy |
collection | PubMed |
description | Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses short-read sequencers along with library preparation protocols that facilitates long-read assemblies have greatly reduced the complexities of genome scale phasing. Moreover, it is possible to accurately assemble phased genome of individual samples using these methods. Therefore, in this study, we compared three phasing strategies which included two sample preparation methods along with the Long Ranger pipeline of 10X genomics and HapCut2 software, namely 10X-LG, 10X-HapCut2, and HiC-HapCut2 and assessed their performance and accuracy. We found that the 10X-LG had the best phasing performance amongst the method analyzed. They had the highest phasing rate (89.6%), longest adjusted N50 (1.24 Mb), and lowest switch error rate (0.07%). Moreover, the phasing accuracy and yield of the 10X-LG stayed over 90% for distances up to 4 Mb and 550 Kb respectively, which were considerably higher than 10X-HapCut2 and Hi-C Hapcut2. The results of this study will serve as a good reference for future benchmarking studies and also for reference-based imputation in Hanwoo. |
format | Online Article Text |
id | pubmed-7140831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71408312020-04-10 A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle Srikanth, Krishnamoorthy Park, Jong-Eun Lim, Dajeong Cha, Jihye Cho, Sang-Rae Cho, In-Cheol Park, Woncheoul Genes (Basel) Communication Until recently, genome-scale phasing was limited due to the short read sizes of sequence data. Though the use of long-read sequencing can overcome this limitation, they require extensive error correction. The emergence of technologies such as 10X genomics linked read sequencing and Hi-C which uses short-read sequencers along with library preparation protocols that facilitates long-read assemblies have greatly reduced the complexities of genome scale phasing. Moreover, it is possible to accurately assemble phased genome of individual samples using these methods. Therefore, in this study, we compared three phasing strategies which included two sample preparation methods along with the Long Ranger pipeline of 10X genomics and HapCut2 software, namely 10X-LG, 10X-HapCut2, and HiC-HapCut2 and assessed their performance and accuracy. We found that the 10X-LG had the best phasing performance amongst the method analyzed. They had the highest phasing rate (89.6%), longest adjusted N50 (1.24 Mb), and lowest switch error rate (0.07%). Moreover, the phasing accuracy and yield of the 10X-LG stayed over 90% for distances up to 4 Mb and 550 Kb respectively, which were considerably higher than 10X-HapCut2 and Hi-C Hapcut2. The results of this study will serve as a good reference for future benchmarking studies and also for reference-based imputation in Hanwoo. MDPI 2020-03-20 /pmc/articles/PMC7140831/ /pubmed/32245072 http://dx.doi.org/10.3390/genes11030332 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Srikanth, Krishnamoorthy Park, Jong-Eun Lim, Dajeong Cha, Jihye Cho, Sang-Rae Cho, In-Cheol Park, Woncheoul A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title | A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title_full | A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title_fullStr | A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title_full_unstemmed | A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title_short | A Comparison between Hi-C and 10X Genomics Linked Read Sequencing for Whole Genome Phasing in Hanwoo Cattle |
title_sort | comparison between hi-c and 10x genomics linked read sequencing for whole genome phasing in hanwoo cattle |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140831/ https://www.ncbi.nlm.nih.gov/pubmed/32245072 http://dx.doi.org/10.3390/genes11030332 |
work_keys_str_mv | AT srikanthkrishnamoorthy acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT parkjongeun acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT limdajeong acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT chajihye acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT chosangrae acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT choincheol acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT parkwoncheoul acomparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT srikanthkrishnamoorthy comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT parkjongeun comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT limdajeong comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT chajihye comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT chosangrae comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT choincheol comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle AT parkwoncheoul comparisonbetweenhicand10xgenomicslinkedreadsequencingforwholegenomephasinginhanwoocattle |