Cargando…

Comparison of multiple algorithms to reliably detect structural variants in pears

BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yueyuan, Zhang, Mingyue, Sun, Jieying, Chang, Wenjing, Sun, Manyi, Zhang, Shaoling, Wu, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972009/
https://www.ncbi.nlm.nih.gov/pubmed/31959124
http://dx.doi.org/10.1186/s12864-020-6455-x
_version_ 1783489833856925696
author Liu, Yueyuan
Zhang, Mingyue
Sun, Jieying
Chang, Wenjing
Sun, Manyi
Zhang, Shaoling
Wu, Jun
author_facet Liu, Yueyuan
Zhang, Mingyue
Sun, Jieying
Chang, Wenjing
Sun, Manyi
Zhang, Shaoling
Wu, Jun
author_sort Liu, Yueyuan
collection PubMed
description BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitable sequencing depth for detecting SVs in pear is also not known. RESULTS: In this study, a pipeline to detect SVs using next-generation and long-read sequencing data was constructed. The performances of seven types of SV detection software using next-generation sequencing (NGS) data and two types of software using long-read sequencing data (SVIM and Sniffles), which are based on different algorithms, were compared. Of the nine software packages evaluated, SVIM identified the most SVs, and Sniffles detected SVs with the highest accuracy (> 90%). When the results from multiple SV detection tools were combined, the SVs identified by both MetaSV and IMR/DENOM, which use NGS data, were more accurate than those identified by both SVIM and Sniffles, with mean accuracies of 98.7 and 96.5%, respectively. The software packages using long-read sequencing data required fewer CPU cores and less memory and ran faster than those using NGS data. In addition, according to the performances of assembly-based algorithms using NGS data, we found that a sequencing depth of 50× is appropriate for detecting SVs in the pear genome. CONCLUSION: This study provides strong evidence that more than one SV detection software package, each based on a different algorithm, should be used to detect SVs with higher confidence, and that long-read sequencing data are better than NGS data for SV detection. The SV detection pipeline that we have established will facilitate the study of diversity in other crops.
format Online
Article
Text
id pubmed-6972009
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69720092020-01-27 Comparison of multiple algorithms to reliably detect structural variants in pears Liu, Yueyuan Zhang, Mingyue Sun, Jieying Chang, Wenjing Sun, Manyi Zhang, Shaoling Wu, Jun BMC Genomics Research Article BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitable sequencing depth for detecting SVs in pear is also not known. RESULTS: In this study, a pipeline to detect SVs using next-generation and long-read sequencing data was constructed. The performances of seven types of SV detection software using next-generation sequencing (NGS) data and two types of software using long-read sequencing data (SVIM and Sniffles), which are based on different algorithms, were compared. Of the nine software packages evaluated, SVIM identified the most SVs, and Sniffles detected SVs with the highest accuracy (> 90%). When the results from multiple SV detection tools were combined, the SVs identified by both MetaSV and IMR/DENOM, which use NGS data, were more accurate than those identified by both SVIM and Sniffles, with mean accuracies of 98.7 and 96.5%, respectively. The software packages using long-read sequencing data required fewer CPU cores and less memory and ran faster than those using NGS data. In addition, according to the performances of assembly-based algorithms using NGS data, we found that a sequencing depth of 50× is appropriate for detecting SVs in the pear genome. CONCLUSION: This study provides strong evidence that more than one SV detection software package, each based on a different algorithm, should be used to detect SVs with higher confidence, and that long-read sequencing data are better than NGS data for SV detection. The SV detection pipeline that we have established will facilitate the study of diversity in other crops. BioMed Central 2020-01-20 /pmc/articles/PMC6972009/ /pubmed/31959124 http://dx.doi.org/10.1186/s12864-020-6455-x Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Liu, Yueyuan
Zhang, Mingyue
Sun, Jieying
Chang, Wenjing
Sun, Manyi
Zhang, Shaoling
Wu, Jun
Comparison of multiple algorithms to reliably detect structural variants in pears
title Comparison of multiple algorithms to reliably detect structural variants in pears
title_full Comparison of multiple algorithms to reliably detect structural variants in pears
title_fullStr Comparison of multiple algorithms to reliably detect structural variants in pears
title_full_unstemmed Comparison of multiple algorithms to reliably detect structural variants in pears
title_short Comparison of multiple algorithms to reliably detect structural variants in pears
title_sort comparison of multiple algorithms to reliably detect structural variants in pears
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972009/
https://www.ncbi.nlm.nih.gov/pubmed/31959124
http://dx.doi.org/10.1186/s12864-020-6455-x
work_keys_str_mv AT liuyueyuan comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT zhangmingyue comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT sunjieying comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT changwenjing comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT sunmanyi comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT zhangshaoling comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears
AT wujun comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears