Cargando…

The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content

Whole-genome sequencing (WGS) is becoming the new standard for bacterial high-resolution typing and the performance of laboratories is being evaluated in interlaboratory comparisons. The use of the Illumina Nextera XT library preparation kit has been found to be associated with poorer performance du...

Descripción completa

Detalles Bibliográficos
Autores principales: Segerman, Bo, Ástvaldsson, Ásgeir, Mustafa, Linda, Skarin, Joakim, Skarin, Hanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9330563/
https://www.ncbi.nlm.nih.gov/pubmed/35910628
http://dx.doi.org/10.3389/fmicb.2022.944770
_version_ 1784758191972679680
author Segerman, Bo
Ástvaldsson, Ásgeir
Mustafa, Linda
Skarin, Joakim
Skarin, Hanna
author_facet Segerman, Bo
Ástvaldsson, Ásgeir
Mustafa, Linda
Skarin, Joakim
Skarin, Hanna
author_sort Segerman, Bo
collection PubMed
description Whole-genome sequencing (WGS) is becoming the new standard for bacterial high-resolution typing and the performance of laboratories is being evaluated in interlaboratory comparisons. The use of the Illumina Nextera XT library preparation kit has been found to be associated with poorer performance due to a GC-content-dependent coverage bias. The bias is especially strong when sequencing low GC-content species. Here, we have made an in-depth analysis of the Nextera XT coverage bias problem using data from a proficiency test of the low GC-content species Campylobacter jejuni. We have compared Nextera XT with Nextera Flex/DNA Prep and examined the consequences on downstream WGS analysis when using different quantities of raw data. We have also analyzed how the coverage bias relates to differential usage of tagmentation cleavage sites. We found that the tagmentation site was characterized by a symmetrical motif with a central AT-rich region surrounded by Gs and Cs. The Gs and Cs appeared to be the main determinant for cleavage efficiency and the genomic regions that were associated with low coverage only contained low-efficiency cleavage sites. This explains why low GC-content genomes and regions are more subjected to coverage bias. We furthermore extended our analysis to other datasets representing other bacterial species. We visualized how the coverage bias was large in low GC-content species such as C. jejuni, C. coli, Staphylococcus aureus, and Listeria monocytogenes, whereas species with neutral GC-content such as Salmonella enterica and Escherichia coli were only affected in certain regions. Species with high GC-content such as Mycobacterium tuberculosis and Pseudomonas aeruginosa were hardly affected at all. The coverage bias associated with Nextera XT was not found when Nextera Flex/DNA Prep had been used.
format Online
Article
Text
id pubmed-9330563
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93305632022-07-29 The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content Segerman, Bo Ástvaldsson, Ásgeir Mustafa, Linda Skarin, Joakim Skarin, Hanna Front Microbiol Microbiology Whole-genome sequencing (WGS) is becoming the new standard for bacterial high-resolution typing and the performance of laboratories is being evaluated in interlaboratory comparisons. The use of the Illumina Nextera XT library preparation kit has been found to be associated with poorer performance due to a GC-content-dependent coverage bias. The bias is especially strong when sequencing low GC-content species. Here, we have made an in-depth analysis of the Nextera XT coverage bias problem using data from a proficiency test of the low GC-content species Campylobacter jejuni. We have compared Nextera XT with Nextera Flex/DNA Prep and examined the consequences on downstream WGS analysis when using different quantities of raw data. We have also analyzed how the coverage bias relates to differential usage of tagmentation cleavage sites. We found that the tagmentation site was characterized by a symmetrical motif with a central AT-rich region surrounded by Gs and Cs. The Gs and Cs appeared to be the main determinant for cleavage efficiency and the genomic regions that were associated with low coverage only contained low-efficiency cleavage sites. This explains why low GC-content genomes and regions are more subjected to coverage bias. We furthermore extended our analysis to other datasets representing other bacterial species. We visualized how the coverage bias was large in low GC-content species such as C. jejuni, C. coli, Staphylococcus aureus, and Listeria monocytogenes, whereas species with neutral GC-content such as Salmonella enterica and Escherichia coli were only affected in certain regions. Species with high GC-content such as Mycobacterium tuberculosis and Pseudomonas aeruginosa were hardly affected at all. The coverage bias associated with Nextera XT was not found when Nextera Flex/DNA Prep had been used. Frontiers Media S.A. 2022-07-14 /pmc/articles/PMC9330563/ /pubmed/35910628 http://dx.doi.org/10.3389/fmicb.2022.944770 Text en Copyright © 2022 Segerman, Ástvaldsson, Mustafa, Skarin and Skarin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Segerman, Bo
Ástvaldsson, Ásgeir
Mustafa, Linda
Skarin, Joakim
Skarin, Hanna
The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title_full The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title_fullStr The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title_full_unstemmed The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title_short The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content
title_sort efficiency of nextera xt tagmentation depends on g and c bases in the binding motif leading to uneven coverage in bacterial species with low and neutral gc-content
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9330563/
https://www.ncbi.nlm.nih.gov/pubmed/35910628
http://dx.doi.org/10.3389/fmicb.2022.944770
work_keys_str_mv AT segermanbo theefficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT astvaldssonasgeir theefficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT mustafalinda theefficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT skarinjoakim theefficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT skarinhanna theefficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT segermanbo efficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT astvaldssonasgeir efficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT mustafalinda efficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT skarinjoakim efficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent
AT skarinhanna efficiencyofnexteraxttagmentationdependsongandcbasesinthebindingmotifleadingtounevencoverageinbacterialspecieswithlowandneutralgccontent