Cargando…

Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus

Ion Torrent Personal Genome Machine (PGM) technology is a mid-length read, low-cost and high-speed next-generation sequencing platform with a relatively high insertion and deletion (indel) error rate. A full systematic assessment of the effectiveness of various error correction algorithms in PGM vir...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Liting, Huang, Wenxun, Kang, Juan, Huang, Yuan, Ren, Hong, Ding, Keyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5556038/
https://www.ncbi.nlm.nih.gov/pubmed/28808243
http://dx.doi.org/10.1038/s41598-017-08139-y
_version_ 1783256990583095296
author Song, Liting
Huang, Wenxun
Kang, Juan
Huang, Yuan
Ren, Hong
Ding, Keyue
author_facet Song, Liting
Huang, Wenxun
Kang, Juan
Huang, Yuan
Ren, Hong
Ding, Keyue
author_sort Song, Liting
collection PubMed
description Ion Torrent Personal Genome Machine (PGM) technology is a mid-length read, low-cost and high-speed next-generation sequencing platform with a relatively high insertion and deletion (indel) error rate. A full systematic assessment of the effectiveness of various error correction algorithms in PGM viral datasets (e.g., hepatitis B virus (HBV)) has not been performed. We examined 19 quality-trimmed PGM datasets for the HBV reverse transcriptase (RT) region and found a total error rate of 0.48% ± 0.12%. Deletion errors were clearly present at the ends of homopolymer runs. Tests using both real and simulated data showed that the algorithms differed in their abilities to detect and correct errors and that the error rate and sequencing depth significantly affected the performance. Of the algorithms tested, Pollux showed a better overall performance but tended to over-correct ‘genuine’ substitution variants, whereas Fiona proved to be better at distinguishing these variants from sequencing errors. We found that the combined use of Pollux and Fiona gave the best results when error-correcting Ion Torrent PGM viral data.
format Online
Article
Text
id pubmed-5556038
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-55560382017-08-16 Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus Song, Liting Huang, Wenxun Kang, Juan Huang, Yuan Ren, Hong Ding, Keyue Sci Rep Article Ion Torrent Personal Genome Machine (PGM) technology is a mid-length read, low-cost and high-speed next-generation sequencing platform with a relatively high insertion and deletion (indel) error rate. A full systematic assessment of the effectiveness of various error correction algorithms in PGM viral datasets (e.g., hepatitis B virus (HBV)) has not been performed. We examined 19 quality-trimmed PGM datasets for the HBV reverse transcriptase (RT) region and found a total error rate of 0.48% ± 0.12%. Deletion errors were clearly present at the ends of homopolymer runs. Tests using both real and simulated data showed that the algorithms differed in their abilities to detect and correct errors and that the error rate and sequencing depth significantly affected the performance. Of the algorithms tested, Pollux showed a better overall performance but tended to over-correct ‘genuine’ substitution variants, whereas Fiona proved to be better at distinguishing these variants from sequencing errors. We found that the combined use of Pollux and Fiona gave the best results when error-correcting Ion Torrent PGM viral data. Nature Publishing Group UK 2017-08-14 /pmc/articles/PMC5556038/ /pubmed/28808243 http://dx.doi.org/10.1038/s41598-017-08139-y Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Song, Liting
Huang, Wenxun
Kang, Juan
Huang, Yuan
Ren, Hong
Ding, Keyue
Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title_full Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title_fullStr Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title_full_unstemmed Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title_short Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus
title_sort comparison of error correction algorithms for ion torrent pgm data: application to hepatitis b virus
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5556038/
https://www.ncbi.nlm.nih.gov/pubmed/28808243
http://dx.doi.org/10.1038/s41598-017-08139-y
work_keys_str_mv AT songliting comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus
AT huangwenxun comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus
AT kangjuan comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus
AT huangyuan comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus
AT renhong comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus
AT dingkeyue comparisonoferrorcorrectionalgorithmsforiontorrentpgmdataapplicationtohepatitisbvirus