Cargando…

Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus

Genome sequencing is rapidly being adopted in reference labs and hospitals for bacterial outbreak investigation and diagnostics where time is critical. Seven gene multi-locus sequence typing is a standard tool for broadly classifying samples into sequence types (STs), allowing, in many cases, to rul...

Descripción completa

Detalles Bibliográficos
Autores principales:	Page, Andrew J., Keane, Jacqueline A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2018
Materias:	Bioinformatics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6074768/ https://www.ncbi.nlm.nih.gov/pubmed/30083440 http://dx.doi.org/10.7717/peerj.5233

_version_	1783344483498196992
author	Page, Andrew J. Keane, Jacqueline A.
author_facet	Page, Andrew J. Keane, Jacqueline A.
author_sort	Page, Andrew J.
collection	PubMed
description	Genome sequencing is rapidly being adopted in reference labs and hospitals for bacterial outbreak investigation and diagnostics where time is critical. Seven gene multi-locus sequence typing is a standard tool for broadly classifying samples into sequence types (STs), allowing, in many cases, to rule a sample out of an outbreak, or allowing for general characteristics about a bacterial strain to be inferred. Long-read sequencing technologies, such as from Oxford Nanopore, can produce read data within minutes of an experiment starting, unlike short-read sequencing technologies which require many hours/days. However, the error rates of raw uncorrected long read data are very high. We present Krocus which can predict a ST directly from uncorrected long reads, and which was designed to consume read data as it is produced, providing results in minutes. It is the only tool which can do this from uncorrected long reads. We tested Krocus on over 700 isolates sequenced using long-read sequencing technologies from Pacific Biosciences and Oxford Nanopore. It provides STs for isolates on average within 90 s, with a sensitivity of 94% and specificity of 97% on real sample data, directly from uncorrected raw sequence reads. The software is written in Python and is available under the open source license GNU GPL version 3.
format	Online Article Text
id	pubmed-6074768
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-60747682018-08-06 Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus Page, Andrew J. Keane, Jacqueline A. PeerJ Bioinformatics Genome sequencing is rapidly being adopted in reference labs and hospitals for bacterial outbreak investigation and diagnostics where time is critical. Seven gene multi-locus sequence typing is a standard tool for broadly classifying samples into sequence types (STs), allowing, in many cases, to rule a sample out of an outbreak, or allowing for general characteristics about a bacterial strain to be inferred. Long-read sequencing technologies, such as from Oxford Nanopore, can produce read data within minutes of an experiment starting, unlike short-read sequencing technologies which require many hours/days. However, the error rates of raw uncorrected long read data are very high. We present Krocus which can predict a ST directly from uncorrected long reads, and which was designed to consume read data as it is produced, providing results in minutes. It is the only tool which can do this from uncorrected long reads. We tested Krocus on over 700 isolates sequenced using long-read sequencing technologies from Pacific Biosciences and Oxford Nanopore. It provides STs for isolates on average within 90 s, with a sensitivity of 94% and specificity of 97% on real sample data, directly from uncorrected raw sequence reads. The software is written in Python and is available under the open source license GNU GPL version 3. PeerJ Inc. 2018-07-31 /pmc/articles/PMC6074768/ /pubmed/30083440 http://dx.doi.org/10.7717/peerj.5233 Text en © 2018 Page and Keane http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle	Bioinformatics Page, Andrew J. Keane, Jacqueline A. Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title	Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title_full	Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title_fullStr	Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title_full_unstemmed	Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title_short	Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus
title_sort	rapid multi-locus sequence typing direct from uncorrected long reads using krocus
topic	Bioinformatics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6074768/ https://www.ncbi.nlm.nih.gov/pubmed/30083440 http://dx.doi.org/10.7717/peerj.5233
work_keys_str_mv	AT pageandrewj rapidmultilocussequencetypingdirectfromuncorrectedlongreadsusingkrocus AT keanejacquelinea rapidmultilocussequencetypingdirectfromuncorrectedlongreadsusingkrocus

Rapid multi-locus sequence typing direct from uncorrected long reads using Krocus

Ejemplares similares