Cargando…

Polypolish: Short-read polishing of long-read bacterial genome assemblies

Long-read-only bacterial genome assemblies usually contain residual errors, most commonly homopolymer-length errors. Short-read polishing tools can use short reads to fix these errors, but most rely on short-read alignment which is unreliable in repeat regions. Errors in such regions are therefore c...

Descripción completa

Detalles Bibliográficos
Autores principales: Wick, Ryan R., Holt, Kathryn E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8812927/
https://www.ncbi.nlm.nih.gov/pubmed/35073327
http://dx.doi.org/10.1371/journal.pcbi.1009802
_version_ 1784644761003491328
author Wick, Ryan R.
Holt, Kathryn E.
author_facet Wick, Ryan R.
Holt, Kathryn E.
author_sort Wick, Ryan R.
collection PubMed
description Long-read-only bacterial genome assemblies usually contain residual errors, most commonly homopolymer-length errors. Short-read polishing tools can use short reads to fix these errors, but most rely on short-read alignment which is unreliable in repeat regions. Errors in such regions are therefore challenging to fix and often remain after short-read polishing. Here we introduce Polypolish, a new short-read polisher which uses all-per-read alignments to repair errors in repeat sequences that other polishers cannot. Polypolish performed well in benchmarking tests using both simulated and real reads, and it almost never introduced errors during polishing. The best results were achieved by using Polypolish in combination with other short-read polishers.
format Online
Article
Text
id pubmed-8812927
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88129272022-02-04 Polypolish: Short-read polishing of long-read bacterial genome assemblies Wick, Ryan R. Holt, Kathryn E. PLoS Comput Biol Research Article Long-read-only bacterial genome assemblies usually contain residual errors, most commonly homopolymer-length errors. Short-read polishing tools can use short reads to fix these errors, but most rely on short-read alignment which is unreliable in repeat regions. Errors in such regions are therefore challenging to fix and often remain after short-read polishing. Here we introduce Polypolish, a new short-read polisher which uses all-per-read alignments to repair errors in repeat sequences that other polishers cannot. Polypolish performed well in benchmarking tests using both simulated and real reads, and it almost never introduced errors during polishing. The best results were achieved by using Polypolish in combination with other short-read polishers. Public Library of Science 2022-01-24 /pmc/articles/PMC8812927/ /pubmed/35073327 http://dx.doi.org/10.1371/journal.pcbi.1009802 Text en © 2022 Wick, Holt https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wick, Ryan R.
Holt, Kathryn E.
Polypolish: Short-read polishing of long-read bacterial genome assemblies
title Polypolish: Short-read polishing of long-read bacterial genome assemblies
title_full Polypolish: Short-read polishing of long-read bacterial genome assemblies
title_fullStr Polypolish: Short-read polishing of long-read bacterial genome assemblies
title_full_unstemmed Polypolish: Short-read polishing of long-read bacterial genome assemblies
title_short Polypolish: Short-read polishing of long-read bacterial genome assemblies
title_sort polypolish: short-read polishing of long-read bacterial genome assemblies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8812927/
https://www.ncbi.nlm.nih.gov/pubmed/35073327
http://dx.doi.org/10.1371/journal.pcbi.1009802
work_keys_str_mv AT wickryanr polypolishshortreadpolishingoflongreadbacterialgenomeassemblies
AT holtkathryne polypolishshortreadpolishingoflongreadbacterialgenomeassemblies