Cargando…
Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data
Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637281/ https://www.ncbi.nlm.nih.gov/pubmed/34868242 http://dx.doi.org/10.3389/fgene.2021.761791 |
_version_ | 1784608709845975040 |
---|---|
author | Bolognini , Davide Magi , Alberto |
author_facet | Bolognini , Davide Magi , Alberto |
author_sort | Bolognini , Davide |
collection | PubMed |
description | Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads from Oxford Nanopore Technologies have already been proven invaluable for the discovery of large SVs and hold the potential to facilitate the resolution of the full SV spectrum. With many long-read sequencing studies to follow, it is crucial to assess factors affecting current SV calling pipelines for nanopore sequencing data. In this brief research report, we evaluate and compare the performances of five long-read SV callers across four long-read aligners using both real and synthetic nanopore datasets. In particular, we focus on the effects of read alignment, sequencing coverage, and variant allele depth on the detection and genotyping of SVs of different types and size ranges and provide insights into precision and recall of SV callsets generated by integrating the various long-read aligners and SV callers. The computational pipeline we propose is publicly available at https://github.com/davidebolo1993/EViNCe and can be adjusted to further evaluate future nanopore sequencing datasets. |
format | Online Article Text |
id | pubmed-8637281 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86372812021-12-03 Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data Bolognini , Davide Magi , Alberto Front Genet Genetics Structural variants (SVs) are genomic rearrangements that involve at least 50 nucleotides and are known to have a serious impact on human health. While prior short-read sequencing technologies have often proved inadequate for a comprehensive assessment of structural variation, more recent long reads from Oxford Nanopore Technologies have already been proven invaluable for the discovery of large SVs and hold the potential to facilitate the resolution of the full SV spectrum. With many long-read sequencing studies to follow, it is crucial to assess factors affecting current SV calling pipelines for nanopore sequencing data. In this brief research report, we evaluate and compare the performances of five long-read SV callers across four long-read aligners using both real and synthetic nanopore datasets. In particular, we focus on the effects of read alignment, sequencing coverage, and variant allele depth on the detection and genotyping of SVs of different types and size ranges and provide insights into precision and recall of SV callsets generated by integrating the various long-read aligners and SV callers. The computational pipeline we propose is publicly available at https://github.com/davidebolo1993/EViNCe and can be adjusted to further evaluate future nanopore sequencing datasets. Frontiers Media S.A. 2021-11-18 /pmc/articles/PMC8637281/ /pubmed/34868242 http://dx.doi.org/10.3389/fgene.2021.761791 Text en Copyright © 2021 Bolognini and Magi . https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Bolognini , Davide Magi , Alberto Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title | Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title_full | Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title_fullStr | Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title_full_unstemmed | Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title_short | Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data |
title_sort | evaluation of germline structural variant calling methods for nanopore sequencing data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637281/ https://www.ncbi.nlm.nih.gov/pubmed/34868242 http://dx.doi.org/10.3389/fgene.2021.761791 |
work_keys_str_mv | AT bologninidavide evaluationofgermlinestructuralvariantcallingmethodsfornanoporesequencingdata AT magialberto evaluationofgermlinestructuralvariantcallingmethodsfornanoporesequencingdata |