Cargando…
Comparison of multiple algorithms to reliably detect structural variants in pears
BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitabl...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972009/ https://www.ncbi.nlm.nih.gov/pubmed/31959124 http://dx.doi.org/10.1186/s12864-020-6455-x |
_version_ | 1783489833856925696 |
---|---|
author | Liu, Yueyuan Zhang, Mingyue Sun, Jieying Chang, Wenjing Sun, Manyi Zhang, Shaoling Wu, Jun |
author_facet | Liu, Yueyuan Zhang, Mingyue Sun, Jieying Chang, Wenjing Sun, Manyi Zhang, Shaoling Wu, Jun |
author_sort | Liu, Yueyuan |
collection | PubMed |
description | BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitable sequencing depth for detecting SVs in pear is also not known. RESULTS: In this study, a pipeline to detect SVs using next-generation and long-read sequencing data was constructed. The performances of seven types of SV detection software using next-generation sequencing (NGS) data and two types of software using long-read sequencing data (SVIM and Sniffles), which are based on different algorithms, were compared. Of the nine software packages evaluated, SVIM identified the most SVs, and Sniffles detected SVs with the highest accuracy (> 90%). When the results from multiple SV detection tools were combined, the SVs identified by both MetaSV and IMR/DENOM, which use NGS data, were more accurate than those identified by both SVIM and Sniffles, with mean accuracies of 98.7 and 96.5%, respectively. The software packages using long-read sequencing data required fewer CPU cores and less memory and ran faster than those using NGS data. In addition, according to the performances of assembly-based algorithms using NGS data, we found that a sequencing depth of 50× is appropriate for detecting SVs in the pear genome. CONCLUSION: This study provides strong evidence that more than one SV detection software package, each based on a different algorithm, should be used to detect SVs with higher confidence, and that long-read sequencing data are better than NGS data for SV detection. The SV detection pipeline that we have established will facilitate the study of diversity in other crops. |
format | Online Article Text |
id | pubmed-6972009 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69720092020-01-27 Comparison of multiple algorithms to reliably detect structural variants in pears Liu, Yueyuan Zhang, Mingyue Sun, Jieying Chang, Wenjing Sun, Manyi Zhang, Shaoling Wu, Jun BMC Genomics Research Article BACKGROUND: Structural variations (SVs) have been reported to play an important role in genetic diversity and trait regulation. Many computer algorithms detecting SVs have recently been developed, but the use of multiple algorithms to detect high-confidence SVs has not been studied. The most suitable sequencing depth for detecting SVs in pear is also not known. RESULTS: In this study, a pipeline to detect SVs using next-generation and long-read sequencing data was constructed. The performances of seven types of SV detection software using next-generation sequencing (NGS) data and two types of software using long-read sequencing data (SVIM and Sniffles), which are based on different algorithms, were compared. Of the nine software packages evaluated, SVIM identified the most SVs, and Sniffles detected SVs with the highest accuracy (> 90%). When the results from multiple SV detection tools were combined, the SVs identified by both MetaSV and IMR/DENOM, which use NGS data, were more accurate than those identified by both SVIM and Sniffles, with mean accuracies of 98.7 and 96.5%, respectively. The software packages using long-read sequencing data required fewer CPU cores and less memory and ran faster than those using NGS data. In addition, according to the performances of assembly-based algorithms using NGS data, we found that a sequencing depth of 50× is appropriate for detecting SVs in the pear genome. CONCLUSION: This study provides strong evidence that more than one SV detection software package, each based on a different algorithm, should be used to detect SVs with higher confidence, and that long-read sequencing data are better than NGS data for SV detection. The SV detection pipeline that we have established will facilitate the study of diversity in other crops. BioMed Central 2020-01-20 /pmc/articles/PMC6972009/ /pubmed/31959124 http://dx.doi.org/10.1186/s12864-020-6455-x Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Liu, Yueyuan Zhang, Mingyue Sun, Jieying Chang, Wenjing Sun, Manyi Zhang, Shaoling Wu, Jun Comparison of multiple algorithms to reliably detect structural variants in pears |
title | Comparison of multiple algorithms to reliably detect structural variants in pears |
title_full | Comparison of multiple algorithms to reliably detect structural variants in pears |
title_fullStr | Comparison of multiple algorithms to reliably detect structural variants in pears |
title_full_unstemmed | Comparison of multiple algorithms to reliably detect structural variants in pears |
title_short | Comparison of multiple algorithms to reliably detect structural variants in pears |
title_sort | comparison of multiple algorithms to reliably detect structural variants in pears |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972009/ https://www.ncbi.nlm.nih.gov/pubmed/31959124 http://dx.doi.org/10.1186/s12864-020-6455-x |
work_keys_str_mv | AT liuyueyuan comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT zhangmingyue comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT sunjieying comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT changwenjing comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT sunmanyi comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT zhangshaoling comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears AT wujun comparisonofmultiplealgorithmstoreliablydetectstructuralvariantsinpears |