Cargando…

Sequencing artifacts derived from a library preparation method using enzymatic fragmentation

DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have e...

Descripción completa

Detalles Bibliográficos
Autores principales: Tanaka, Norio, Takahara, Akihisa, Hagio, Taichi, Nishiko, Rika, Kanayama, Junko, Gotoh, Osamu, Mori, Seiichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941819/
https://www.ncbi.nlm.nih.gov/pubmed/31899787
http://dx.doi.org/10.1371/journal.pone.0227427
_version_ 1783484602733559808
author Tanaka, Norio
Takahara, Akihisa
Hagio, Taichi
Nishiko, Rika
Kanayama, Junko
Gotoh, Osamu
Mori, Seiichi
author_facet Tanaka, Norio
Takahara, Akihisa
Hagio, Taichi
Nishiko, Rika
Kanayama, Junko
Gotoh, Osamu
Mori, Seiichi
author_sort Tanaka, Norio
collection PubMed
description DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.
format Online
Article
Text
id pubmed-6941819
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69418192020-01-10 Sequencing artifacts derived from a library preparation method using enzymatic fragmentation Tanaka, Norio Takahara, Akihisa Hagio, Taichi Nishiko, Rika Kanayama, Junko Gotoh, Osamu Mori, Seiichi PLoS One Research Article DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study. Public Library of Science 2020-01-03 /pmc/articles/PMC6941819/ /pubmed/31899787 http://dx.doi.org/10.1371/journal.pone.0227427 Text en © 2020 Tanaka et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tanaka, Norio
Takahara, Akihisa
Hagio, Taichi
Nishiko, Rika
Kanayama, Junko
Gotoh, Osamu
Mori, Seiichi
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title_full Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title_fullStr Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title_full_unstemmed Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title_short Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
title_sort sequencing artifacts derived from a library preparation method using enzymatic fragmentation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941819/
https://www.ncbi.nlm.nih.gov/pubmed/31899787
http://dx.doi.org/10.1371/journal.pone.0227427
work_keys_str_mv AT tanakanorio sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT takaharaakihisa sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT hagiotaichi sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT nishikorika sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT kanayamajunko sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT gotohosamu sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT moriseiichi sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation