Cargando…

Quality filtering of Illumina index reads mitigates sample cross-talk

BACKGROUND: Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wright, Erik Scott, Vetsigian, Kalin Horen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5097354/
https://www.ncbi.nlm.nih.gov/pubmed/27814679
http://dx.doi.org/10.1186/s12864-016-3217-x
_version_ 1782465583159181312
author Wright, Erik Scott
Vetsigian, Kalin Horen
author_facet Wright, Erik Scott
Vetsigian, Kalin Horen
author_sort Wright, Erik Scott
collection PubMed
description BACKGROUND: Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the Illumina sequencing platform. This results in a low rate of cross-talk among multiplexed samples and can cause detrimental effects in studies requiring the detection of rare variants or when multiplexing a large number of samples. RESULTS: We observed rates of cross-talk averaging 0.24 % when multiplexing 14 different samples with unique i5 and i7 index sequences. This cross-talk rate corresponded to 254,632 misassigned reads on a single lane of the Illumina HiSeq 2500. Notably, all types of misassignment occur at similar rates: incorrect i5, incorrect i7, and incorrect sequence reads. We demonstrate that misassignments can be nearly eliminated by quality filtering of index reads while preserving about 90 % of the original sequences. CONCLUSIONS: Cross-talk among multiplexed samples is a significant error mode on the Illumina platform, especially if samples are only separated by a single unique index. Quality filtering of index sequences offers an effective solution to minimizing cross-talk among samples. Furthermore, we propose a straightforward method for verifying the extent of cross-talk between samples and optimizing quality score thresholds that does not require additional control samples and can even be performed post hoc on previous runs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3217-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5097354
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50973542016-11-07 Quality filtering of Illumina index reads mitigates sample cross-talk Wright, Erik Scott Vetsigian, Kalin Horen BMC Genomics Methodology Article BACKGROUND: Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the Illumina sequencing platform. This results in a low rate of cross-talk among multiplexed samples and can cause detrimental effects in studies requiring the detection of rare variants or when multiplexing a large number of samples. RESULTS: We observed rates of cross-talk averaging 0.24 % when multiplexing 14 different samples with unique i5 and i7 index sequences. This cross-talk rate corresponded to 254,632 misassigned reads on a single lane of the Illumina HiSeq 2500. Notably, all types of misassignment occur at similar rates: incorrect i5, incorrect i7, and incorrect sequence reads. We demonstrate that misassignments can be nearly eliminated by quality filtering of index reads while preserving about 90 % of the original sequences. CONCLUSIONS: Cross-talk among multiplexed samples is a significant error mode on the Illumina platform, especially if samples are only separated by a single unique index. Quality filtering of index sequences offers an effective solution to minimizing cross-talk among samples. Furthermore, we propose a straightforward method for verifying the extent of cross-talk between samples and optimizing quality score thresholds that does not require additional control samples and can even be performed post hoc on previous runs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3217-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-04 /pmc/articles/PMC5097354/ /pubmed/27814679 http://dx.doi.org/10.1186/s12864-016-3217-x Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wright, Erik Scott
Vetsigian, Kalin Horen
Quality filtering of Illumina index reads mitigates sample cross-talk
title Quality filtering of Illumina index reads mitigates sample cross-talk
title_full Quality filtering of Illumina index reads mitigates sample cross-talk
title_fullStr Quality filtering of Illumina index reads mitigates sample cross-talk
title_full_unstemmed Quality filtering of Illumina index reads mitigates sample cross-talk
title_short Quality filtering of Illumina index reads mitigates sample cross-talk
title_sort quality filtering of illumina index reads mitigates sample cross-talk
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5097354/
https://www.ncbi.nlm.nih.gov/pubmed/27814679
http://dx.doi.org/10.1186/s12864-016-3217-x
work_keys_str_mv AT wrighterikscott qualityfilteringofilluminaindexreadsmitigatessamplecrosstalk
AT vetsigiankalinhoren qualityfilteringofilluminaindexreadsmitigatessamplecrosstalk