Cargando…

Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms

BACKGROUND: Here we present an in-depth characterization of the mechanism of sequencer-induced sample contamination due to the phenomenon of index swapping that impacts Illumina sequencers employing patterned flow cells with Exclusion Amplification (ExAmp) chemistry (HiSeqX, HiSeq4000, and NovaSeq)....

Descripción completa

Detalles Bibliográficos
Autores principales: Costello, Maura, Fleharty, Mark, Abreu, Justin, Farjoun, Yossi, Ferriera, Steven, Holmes, Laurie, Granger, Brian, Green, Lisa, Howd, Tom, Mason, Tamara, Vicente, Gina, Dasilva, Michael, Brodeur, Wendy, DeSmet, Timothy, Dodge, Sheila, Lennon, Niall J., Gabriel, Stacey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5941783/
https://www.ncbi.nlm.nih.gov/pubmed/29739332
http://dx.doi.org/10.1186/s12864-018-4703-0
_version_ 1783321354691411968
author Costello, Maura
Fleharty, Mark
Abreu, Justin
Farjoun, Yossi
Ferriera, Steven
Holmes, Laurie
Granger, Brian
Green, Lisa
Howd, Tom
Mason, Tamara
Vicente, Gina
Dasilva, Michael
Brodeur, Wendy
DeSmet, Timothy
Dodge, Sheila
Lennon, Niall J.
Gabriel, Stacey
author_facet Costello, Maura
Fleharty, Mark
Abreu, Justin
Farjoun, Yossi
Ferriera, Steven
Holmes, Laurie
Granger, Brian
Green, Lisa
Howd, Tom
Mason, Tamara
Vicente, Gina
Dasilva, Michael
Brodeur, Wendy
DeSmet, Timothy
Dodge, Sheila
Lennon, Niall J.
Gabriel, Stacey
author_sort Costello, Maura
collection PubMed
description BACKGROUND: Here we present an in-depth characterization of the mechanism of sequencer-induced sample contamination due to the phenomenon of index swapping that impacts Illumina sequencers employing patterned flow cells with Exclusion Amplification (ExAmp) chemistry (HiSeqX, HiSeq4000, and NovaSeq). We also present a remediation method that minimizes the impact of such swaps. RESULTS: Leveraging data collected over a two-year period, we demonstrate the widespread prevalence of index swapping in patterned flow cell data. We calculate mean swap rates across multiple sample preparation methods and sequencer models, demonstrating that different library methods can have vastly different swapping rates and that even non-ExAmp chemistry instruments display trace levels of index swapping. We provide methods for eliminating sample data cross contamination by utilizing non-redundant dual indexing for complete filtering of index swapped reads, and share the sequences for 96 non-combinatorial dual indexes we have validated across various library preparation methods and sequencer models. Finally, using computational methods we provide a greater insight into the mechanism of index swapping. CONCLUSIONS: Index swapping in pooled libraries is a prevalent phenomenon that we observe at a rate of 0.2 to 6% in all sequencing runs on HiSeqX, HiSeq 4000/3000, and NovaSeq. Utilizing non-redundant dual indexing allows for the removal (flagging/filtering) of these swapped reads and eliminates swapping induced sample contamination, which is critical for sensitive applications such as RNA-seq, single cell, blood biopsy using circulating tumor DNA, or clinical sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4703-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5941783
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59417832018-05-14 Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms Costello, Maura Fleharty, Mark Abreu, Justin Farjoun, Yossi Ferriera, Steven Holmes, Laurie Granger, Brian Green, Lisa Howd, Tom Mason, Tamara Vicente, Gina Dasilva, Michael Brodeur, Wendy DeSmet, Timothy Dodge, Sheila Lennon, Niall J. Gabriel, Stacey BMC Genomics Research Article BACKGROUND: Here we present an in-depth characterization of the mechanism of sequencer-induced sample contamination due to the phenomenon of index swapping that impacts Illumina sequencers employing patterned flow cells with Exclusion Amplification (ExAmp) chemistry (HiSeqX, HiSeq4000, and NovaSeq). We also present a remediation method that minimizes the impact of such swaps. RESULTS: Leveraging data collected over a two-year period, we demonstrate the widespread prevalence of index swapping in patterned flow cell data. We calculate mean swap rates across multiple sample preparation methods and sequencer models, demonstrating that different library methods can have vastly different swapping rates and that even non-ExAmp chemistry instruments display trace levels of index swapping. We provide methods for eliminating sample data cross contamination by utilizing non-redundant dual indexing for complete filtering of index swapped reads, and share the sequences for 96 non-combinatorial dual indexes we have validated across various library preparation methods and sequencer models. Finally, using computational methods we provide a greater insight into the mechanism of index swapping. CONCLUSIONS: Index swapping in pooled libraries is a prevalent phenomenon that we observe at a rate of 0.2 to 6% in all sequencing runs on HiSeqX, HiSeq 4000/3000, and NovaSeq. Utilizing non-redundant dual indexing allows for the removal (flagging/filtering) of these swapped reads and eliminates swapping induced sample contamination, which is critical for sensitive applications such as RNA-seq, single cell, blood biopsy using circulating tumor DNA, or clinical sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4703-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-05-08 /pmc/articles/PMC5941783/ /pubmed/29739332 http://dx.doi.org/10.1186/s12864-018-4703-0 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Costello, Maura
Fleharty, Mark
Abreu, Justin
Farjoun, Yossi
Ferriera, Steven
Holmes, Laurie
Granger, Brian
Green, Lisa
Howd, Tom
Mason, Tamara
Vicente, Gina
Dasilva, Michael
Brodeur, Wendy
DeSmet, Timothy
Dodge, Sheila
Lennon, Niall J.
Gabriel, Stacey
Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title_full Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title_fullStr Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title_full_unstemmed Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title_short Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
title_sort characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5941783/
https://www.ncbi.nlm.nih.gov/pubmed/29739332
http://dx.doi.org/10.1186/s12864-018-4703-0
work_keys_str_mv AT costellomaura characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT flehartymark characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT abreujustin characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT farjounyossi characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT ferrierasteven characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT holmeslaurie characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT grangerbrian characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT greenlisa characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT howdtom characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT masontamara characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT vicentegina characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT dasilvamichael characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT brodeurwendy characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT desmettimothy characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT dodgesheila characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT lennonniallj characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms
AT gabrielstacey characterizationandremediationofsampleindexswapsbynonredundantdualindexingonmassivelyparallelsequencingplatforms