Cargando…

Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods

Structural variations (SVs), as a great source of genetic variation, are widely distributed in the genome. SVs involve longer genomic sequences and potentially have stronger effects than SNPs, but they are not well captured by short-read sequencing owing to their size and relevance to repeats. Impro...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Yahui, Ma, Li, Liu, George E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142105/
https://www.ncbi.nlm.nih.gov/pubmed/35627213
http://dx.doi.org/10.3390/genes13050828
_version_ 1784715501549649920
author Gao, Yahui
Ma, Li
Liu, George E.
author_facet Gao, Yahui
Ma, Li
Liu, George E.
author_sort Gao, Yahui
collection PubMed
description Structural variations (SVs), as a great source of genetic variation, are widely distributed in the genome. SVs involve longer genomic sequences and potentially have stronger effects than SNPs, but they are not well captured by short-read sequencing owing to their size and relevance to repeats. Improved characterization of SVs can provide more advanced insight into complex traits. With the availability of long-read sequencing, it has become feasible to uncover the full range of SVs. Here, we sequenced one cattle individual using 10× Genomics (10 × G) linked read, Pacific Biosciences (PacBio) continuous long reads (CLR) and circular consensus sequencing (CCS), as well as Oxford Nanopore Technologies (ONT) PromethION. We evaluated the ability of various methods for SV detection. We identified 21,164 SVs, which amount to 186 Mb covering 7.07% of the whole genome. The number of SVs inferred from long-read-based inferences was greater than that from short reads. The PacBio CLR identified the most of large SVs and covered the most genomes. SVs called with PacBio CCS and ONT data showed high uniformity. The one with the most overlap with the results obtained by short-read data was PB CCS. Together, we found that long reads outperformed short reads in terms of SV detections.
format Online
Article
Text
id pubmed-9142105
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91421052022-05-28 Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods Gao, Yahui Ma, Li Liu, George E. Genes (Basel) Article Structural variations (SVs), as a great source of genetic variation, are widely distributed in the genome. SVs involve longer genomic sequences and potentially have stronger effects than SNPs, but they are not well captured by short-read sequencing owing to their size and relevance to repeats. Improved characterization of SVs can provide more advanced insight into complex traits. With the availability of long-read sequencing, it has become feasible to uncover the full range of SVs. Here, we sequenced one cattle individual using 10× Genomics (10 × G) linked read, Pacific Biosciences (PacBio) continuous long reads (CLR) and circular consensus sequencing (CCS), as well as Oxford Nanopore Technologies (ONT) PromethION. We evaluated the ability of various methods for SV detection. We identified 21,164 SVs, which amount to 186 Mb covering 7.07% of the whole genome. The number of SVs inferred from long-read-based inferences was greater than that from short reads. The PacBio CLR identified the most of large SVs and covered the most genomes. SVs called with PacBio CCS and ONT data showed high uniformity. The one with the most overlap with the results obtained by short-read data was PB CCS. Together, we found that long reads outperformed short reads in terms of SV detections. MDPI 2022-05-06 /pmc/articles/PMC9142105/ /pubmed/35627213 http://dx.doi.org/10.3390/genes13050828 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gao, Yahui
Ma, Li
Liu, George E.
Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title_full Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title_fullStr Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title_full_unstemmed Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title_short Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods
title_sort initial analysis of structural variation detections in cattle using long-read sequencing methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142105/
https://www.ncbi.nlm.nih.gov/pubmed/35627213
http://dx.doi.org/10.3390/genes13050828
work_keys_str_mv AT gaoyahui initialanalysisofstructuralvariationdetectionsincattleusinglongreadsequencingmethods
AT mali initialanalysisofstructuralvariationdetectionsincattleusinglongreadsequencingmethods
AT liugeorgee initialanalysisofstructuralvariationdetectionsincattleusinglongreadsequencingmethods