Cargando…

Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring

BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tool...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Yanshuo, Teng, Mingxiang, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907144/
https://www.ncbi.nlm.nih.gov/pubmed/29671389
http://dx.doi.org/10.1186/s12859-018-2099-0
_version_ 1783315472065757184
author Chu, Yanshuo
Teng, Mingxiang
Wang, Yadong
author_facet Chu, Yanshuo
Teng, Mingxiang
Wang, Yadong
author_sort Chu, Yanshuo
collection PubMed
description BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tools consider the GC bias of tumor and normal sample is of the same fature, and could be fully offset by read count ratio. However, we found that, the read count ratio on SCNA segments presents a Log linear biased pattern, which influence existing read count ratios based subclonal inferring tools performance. Currently no correction tools take into account the read ratio bias. RESULTS: We present Pre-SCNAClonal, a tool that improving tumor subclonal population inferring by correcting GC-bias at SCNAs level. Pre-SCNAClonal first corrects GC bias using Markov chain Monte Carlo probability model, then accurately locates baseline DNA segments (not containing any SCNAs) with a hierarchy clustering model. We show Pre-SCNAClonal’s superiority to exsiting GC-bias correction methods at any level of subclonal population. CONCLUSIONS: Pre-SCNAClonal could be run independently as well as serving as pre-processing/gc-correction step in conjuntion with exsiting SCNA-based subclonal inferring tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2099-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5907144
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59071442018-04-30 Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring Chu, Yanshuo Teng, Mingxiang Wang, Yadong BMC Bioinformatics Research BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tools consider the GC bias of tumor and normal sample is of the same fature, and could be fully offset by read count ratio. However, we found that, the read count ratio on SCNA segments presents a Log linear biased pattern, which influence existing read count ratios based subclonal inferring tools performance. Currently no correction tools take into account the read ratio bias. RESULTS: We present Pre-SCNAClonal, a tool that improving tumor subclonal population inferring by correcting GC-bias at SCNAs level. Pre-SCNAClonal first corrects GC bias using Markov chain Monte Carlo probability model, then accurately locates baseline DNA segments (not containing any SCNAs) with a hierarchy clustering model. We show Pre-SCNAClonal’s superiority to exsiting GC-bias correction methods at any level of subclonal population. CONCLUSIONS: Pre-SCNAClonal could be run independently as well as serving as pre-processing/gc-correction step in conjuntion with exsiting SCNA-based subclonal inferring tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2099-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-11 /pmc/articles/PMC5907144/ /pubmed/29671389 http://dx.doi.org/10.1186/s12859-018-2099-0 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chu, Yanshuo
Teng, Mingxiang
Wang, Yadong
Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title_full Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title_fullStr Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title_full_unstemmed Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title_short Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
title_sort modeling and correct the gc bias of tumor and normal wgs data for scna based tumor subclonal population inferring
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907144/
https://www.ncbi.nlm.nih.gov/pubmed/29671389
http://dx.doi.org/10.1186/s12859-018-2099-0
work_keys_str_mv AT chuyanshuo modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring
AT tengmingxiang modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring
AT wangyadong modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring