Cargando…

CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing

BACKGROUND: Long-read nanopore sequencing technology is of particular significance for taxonomic identification at or below the species level. For many environmental samples, the total extractable DNA is far below the current input requirements of nanopore sequencing, preventing “sample to sequence”...

Descripción completa

Detalles Bibliográficos
Autores principales: Mojarro, Angel, Hachey, Julie, Ruvkun, Gary, Zuber, Maria T., Carr, Christopher E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872496/
https://www.ncbi.nlm.nih.gov/pubmed/29587645
http://dx.doi.org/10.1186/s12859-018-2124-3
_version_ 1783309848736169984
author Mojarro, Angel
Hachey, Julie
Ruvkun, Gary
Zuber, Maria T.
Carr, Christopher E.
author_facet Mojarro, Angel
Hachey, Julie
Ruvkun, Gary
Zuber, Maria T.
Carr, Christopher E.
author_sort Mojarro, Angel
collection PubMed
description BACKGROUND: Long-read nanopore sequencing technology is of particular significance for taxonomic identification at or below the species level. For many environmental samples, the total extractable DNA is far below the current input requirements of nanopore sequencing, preventing “sample to sequence” metagenomics from low-biomass or recalcitrant samples. RESULTS: Here we address this problem by employing carrier sequencing, a method to sequence low-input DNA by preparing the target DNA with a genomic carrier to achieve ideal library preparation and sequencing stoichiometry without amplification. We then use CarrierSeq, a sequence analysis workflow to identify the low-input target reads from the genomic carrier. We tested CarrierSeq experimentally by sequencing from a combination of 0.2 ng Bacillus subtilis ATCC 6633 DNA in a background of 1000 ng Enterobacteria phage λ DNA. After filtering of carrier, low quality, and low complexity reads, we detected target reads (B. subtilis), contamination reads, and “high quality noise reads” (HQNRs) not mapping to the carrier, target or known lab contaminants. These reads appear to be artifacts of the nanopore sequencing process as they are associated with specific channels (pores). CONCLUSION: By treating sequencing as a Poisson arrival process, we implement a statistical test to reject data from channels dominated by HQNRs while retaining low-input target reads.
format Online
Article
Text
id pubmed-5872496
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58724962018-04-02 CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing Mojarro, Angel Hachey, Julie Ruvkun, Gary Zuber, Maria T. Carr, Christopher E. BMC Bioinformatics Software BACKGROUND: Long-read nanopore sequencing technology is of particular significance for taxonomic identification at or below the species level. For many environmental samples, the total extractable DNA is far below the current input requirements of nanopore sequencing, preventing “sample to sequence” metagenomics from low-biomass or recalcitrant samples. RESULTS: Here we address this problem by employing carrier sequencing, a method to sequence low-input DNA by preparing the target DNA with a genomic carrier to achieve ideal library preparation and sequencing stoichiometry without amplification. We then use CarrierSeq, a sequence analysis workflow to identify the low-input target reads from the genomic carrier. We tested CarrierSeq experimentally by sequencing from a combination of 0.2 ng Bacillus subtilis ATCC 6633 DNA in a background of 1000 ng Enterobacteria phage λ DNA. After filtering of carrier, low quality, and low complexity reads, we detected target reads (B. subtilis), contamination reads, and “high quality noise reads” (HQNRs) not mapping to the carrier, target or known lab contaminants. These reads appear to be artifacts of the nanopore sequencing process as they are associated with specific channels (pores). CONCLUSION: By treating sequencing as a Poisson arrival process, we implement a statistical test to reject data from channels dominated by HQNRs while retaining low-input target reads. BioMed Central 2018-03-27 /pmc/articles/PMC5872496/ /pubmed/29587645 http://dx.doi.org/10.1186/s12859-018-2124-3 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Mojarro, Angel
Hachey, Julie
Ruvkun, Gary
Zuber, Maria T.
Carr, Christopher E.
CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title_full CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title_fullStr CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title_full_unstemmed CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title_short CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing
title_sort carrierseq: a sequence analysis workflow for low-input nanopore sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872496/
https://www.ncbi.nlm.nih.gov/pubmed/29587645
http://dx.doi.org/10.1186/s12859-018-2124-3
work_keys_str_mv AT mojarroangel carrierseqasequenceanalysisworkflowforlowinputnanoporesequencing
AT hacheyjulie carrierseqasequenceanalysisworkflowforlowinputnanoporesequencing
AT ruvkungary carrierseqasequenceanalysisworkflowforlowinputnanoporesequencing
AT zubermariat carrierseqasequenceanalysisworkflowforlowinputnanoporesequencing
AT carrchristophere carrierseqasequenceanalysisworkflowforlowinputnanoporesequencing