Cargando…
GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level
Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition b...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657415/ https://www.ncbi.nlm.nih.gov/pubmed/38025047 http://dx.doi.org/10.1093/nargab/lqad102 |
_version_ | 1785148146458820608 |
---|---|
author | Spiegl, Benjamin Kapidzic, Faruk Röner, Sebastian Kircher, Martin Speicher, Michael R |
author_facet | Spiegl, Benjamin Kapidzic, Faruk Röner, Sebastian Kircher, Martin Speicher, Michael R |
author_sort | Spiegl, Benjamin |
collection | PubMed |
description | Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition biases of cfDNA, specifically the over - and underrepresentation of guanine (G) and cytosine (C) sequences. Currently, there is no universally applicable tool to correct these effects on sequencing read-level data. Here, we present GCparagon, a two-stage algorithm for computing and correcting GC biases in cfDNA samples. In the initial step, length and GC base count parameters are determined. Here, our algorithm minimizes the inclusion of known problematic genomic regions, such as low-mappability regions, in its calculations. In the second step, GCparagon computes weights counterbalancing the distortion of cfDNA attributes (correction matrix). These fragment weights are added to a binary alignment map (BAM) file as alignment tags for individual reads. The GC correction matrix or the tagged BAM file can be used for downstream analyses. Parallel computing allows for a GC bias estimation below 1 min. We demonstrate that GCparagon vastly improves the analysis of regulatory regions, which frequently show specific GC composition patterns and will contribute to standardized cfDNA applications. |
format | Online Article Text |
id | pubmed-10657415 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106574152023-11-18 GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level Spiegl, Benjamin Kapidzic, Faruk Röner, Sebastian Kircher, Martin Speicher, Michael R NAR Genom Bioinform Application Notes Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition biases of cfDNA, specifically the over - and underrepresentation of guanine (G) and cytosine (C) sequences. Currently, there is no universally applicable tool to correct these effects on sequencing read-level data. Here, we present GCparagon, a two-stage algorithm for computing and correcting GC biases in cfDNA samples. In the initial step, length and GC base count parameters are determined. Here, our algorithm minimizes the inclusion of known problematic genomic regions, such as low-mappability regions, in its calculations. In the second step, GCparagon computes weights counterbalancing the distortion of cfDNA attributes (correction matrix). These fragment weights are added to a binary alignment map (BAM) file as alignment tags for individual reads. The GC correction matrix or the tagged BAM file can be used for downstream analyses. Parallel computing allows for a GC bias estimation below 1 min. We demonstrate that GCparagon vastly improves the analysis of regulatory regions, which frequently show specific GC composition patterns and will contribute to standardized cfDNA applications. Oxford University Press 2023-11-18 /pmc/articles/PMC10657415/ /pubmed/38025047 http://dx.doi.org/10.1093/nargab/lqad102 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Application Notes Spiegl, Benjamin Kapidzic, Faruk Röner, Sebastian Kircher, Martin Speicher, Michael R GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title | GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title_full | GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title_fullStr | GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title_full_unstemmed | GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title_short | GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level |
title_sort | gcparagon: evaluating and correcting gc biases in cell-free dna at the fragment level |
topic | Application Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657415/ https://www.ncbi.nlm.nih.gov/pubmed/38025047 http://dx.doi.org/10.1093/nargab/lqad102 |
work_keys_str_mv | AT spieglbenjamin gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel AT kapidzicfaruk gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel AT ronersebastian gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel AT kirchermartin gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel AT speichermichaelr gcparagonevaluatingandcorrectinggcbiasesincellfreednaatthefragmentlevel |