Cargando…

Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data

Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads...

Descripción completa

Detalles Bibliográficos
Autores principales: Bolognini , Davide, Magi , Alberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637281/
https://www.ncbi.nlm.nih.gov/pubmed/34868242
http://dx.doi.org/10.3389/fgene.2021.761791
_version_ 1784608709845975040
author Bolognini , Davide
Magi , Alberto
author_facet Bolognini , Davide
Magi , Alberto
author_sort Bolognini , Davide
collection PubMed
description Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads from Oxford Nanopore Technologies have already been proven invaluable for the discovery of large SVs and hold the potential to facilitate the resolution of the full SV spectrum. With many long-read sequencing studies to follow, it is crucial to assess factors affecting current SV calling pipelines for nanopore sequencing data. In this brief research report, we evaluate and compare the performances of five long-read SV callers across four long-read aligners using both real and synthetic nanopore datasets. In particular, we focus on the effects of read alignment, sequencing coverage, and variant allele depth on the detection and genotyping of SVs of different types and size ranges and provide insights into precision and recall of SV callsets generated by integrating the various long-read aligners and SV callers. The computational pipeline we propose is publicly available at https://github.com/davidebolo1993/EViNCe and can be adjusted to further evaluate future nanopore sequencing datasets.
format Online
Article
Text
id pubmed-8637281
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86372812021-12-03 Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data Bolognini , Davide Magi , Alberto Front Genet Genetics Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads from Oxford Nanopore Technologies have already been proven invaluable for the discovery of large SVs and hold the potential to facilitate the resolution of the full SV spectrum. With many long-read sequencing studies to follow, it is crucial to assess factors affecting current SV calling pipelines for nanopore sequencing data. In this brief research report, we evaluate and compare the performances of five long-read SV callers across four long-read aligners using both real and synthetic nanopore datasets. In particular, we focus on the effects of read alignment, sequencing coverage, and variant allele depth on the detection and genotyping of SVs of different types and size ranges and provide insights into precision and recall of SV callsets generated by integrating the various long-read aligners and SV callers. The computational pipeline we propose is publicly available at https://github.com/davidebolo1993/EViNCe and can be adjusted to further evaluate future nanopore sequencing datasets. Frontiers Media S.A. 2021-11-18 /pmc/articles/PMC8637281/ /pubmed/34868242 http://dx.doi.org/10.3389/fgene.2021.761791 Text en Copyright © 2021 Bolognini  and Magi . https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Bolognini , Davide
Magi , Alberto
Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title_full Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title_fullStr Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title_full_unstemmed Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title_short Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
title_sort evaluation of germline structural variant calling methods for nanopore sequencing data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637281/
https://www.ncbi.nlm.nih.gov/pubmed/34868242
http://dx.doi.org/10.3389/fgene.2021.761791
work_keys_str_mv AT bologninidavide evaluationofgermlinestructuralvariantcallingmethodsfornanoporesequencingdata
AT magialberto evaluationofgermlinestructuralvariantcallingmethodsfornanoporesequencingdata