Cargando…

GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level

Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition b...

Descripción completa

Detalles Bibliográficos
Autores principales: Spiegl, Benjamin, Kapidzic, Faruk, Röner, Sebastian, Kircher, Martin, Speicher, Michael R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657415/
https://www.ncbi.nlm.nih.gov/pubmed/38025047
http://dx.doi.org/10.1093/nargab/lqad102
_version_ 1785148146458820608
author Spiegl, Benjamin
Kapidzic, Faruk
Röner, Sebastian
Kircher, Martin
Speicher, Michael R
author_facet Spiegl, Benjamin
Kapidzic, Faruk
Röner, Sebastian
Kircher, Martin
Speicher, Michael R
author_sort Spiegl, Benjamin
collection PubMed
description Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition biases of cfDNA, specifically the over - and underrepresentation of guanine (G) and cytosine (C) sequences. Currently, there is no universally applicable tool to correct these effects on sequencing read-level data. Here, we present GCparagon, a two-stage algorithm for computing and correcting GC biases in cfDNA samples. In the initial step, length and GC base count parameters are determined. Here, our algorithm minimizes the inclusion of known problematic genomic regions, such as low-mappability regions, in its calculations. In the second step, GCparagon computes weights counterbalancing the distortion of cfDNA attributes (correction matrix). These fragment weights are added to a binary alignment map (BAM) file as alignment tags for individual reads. The GC correction matrix or the tagged BAM file can be used for downstream analyses. Parallel computing allows for a GC bias estimation below 1 min. We demonstrate that GCparagon vastly improves the analysis of regulatory regions, which frequently show specific GC composition patterns and will contribute to standardized cfDNA applications.
format Online
Article
Text
id pubmed-10657415
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106574152023-11-18 GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level Spiegl, Benjamin Kapidzic, Faruk Röner, Sebastian Kircher, Martin Speicher, Michael R NAR Genom Bioinform Application Notes Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition biases of cfDNA, specifically the over - and underrepresentation of guanine (G) and cytosine (C) sequences. Currently, there is no universally applicable tool to correct these effects on sequencing read-level data. Here, we present GCparagon, a two-stage algorithm for computing and correcting GC biases in cfDNA samples. In the initial step, length and GC base count parameters are determined. Here, our algorithm minimizes the inclusion of known problematic genomic regions, such as low-mappability regions, in its calculations. In the second step, GCparagon computes weights counterbalancing the distortion of cfDNA attributes (correction matrix). These fragment weights are added to a binary alignment map (BAM) file as alignment tags for individual reads. The GC correction matrix or the tagged BAM file can be used for downstream analyses. Parallel computing allows for a GC bias estimation below 1 min. We demonstrate that GCparagon vastly improves the analysis of regulatory regions, which frequently show specific GC composition patterns and will contribute to standardized cfDNA applications. Oxford University Press 2023-11-18 /pmc/articles/PMC10657415/ /pubmed/38025047 http://dx.doi.org/10.1093/nargab/lqad102 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Notes
Spiegl, Benjamin
Kapidzic, Faruk
Röner, Sebastian
Kircher, Martin
Speicher, Michael R
GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title_full GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title_fullStr GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title_full_unstemmed GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title_short GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
title_sort gcparagon: evaluating and correcting gc biases in cell-free dna at the fragment level
topic Application Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657415/
https://www.ncbi.nlm.nih.gov/pubmed/38025047
http://dx.doi.org/10.1093/nargab/lqad102
work_keys_str_mv AT spieglbenjamin gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel
AT kapidzicfaruk gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel
AT ronersebastian gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel
AT kirchermartin gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel
AT speichermichaelr gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel