Cargando…
Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring
BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tool...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907144/ https://www.ncbi.nlm.nih.gov/pubmed/29671389 http://dx.doi.org/10.1186/s12859-018-2099-0 |
_version_ | 1783315472065757184 |
---|---|
author | Chu, Yanshuo Teng, Mingxiang Wang, Yadong |
author_facet | Chu, Yanshuo Teng, Mingxiang Wang, Yadong |
author_sort | Chu, Yanshuo |
collection | PubMed |
description | BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tools consider the GC bias of tumor and normal sample is of the same fature, and could be fully offset by read count ratio. However, we found that, the read count ratio on SCNA segments presents a Log linear biased pattern, which influence existing read count ratios based subclonal inferring tools performance. Currently no correction tools take into account the read ratio bias. RESULTS: We present Pre-SCNAClonal, a tool that improving tumor subclonal population inferring by correcting GC-bias at SCNAs level. Pre-SCNAClonal first corrects GC bias using Markov chain Monte Carlo probability model, then accurately locates baseline DNA segments (not containing any SCNAs) with a hierarchy clustering model. We show Pre-SCNAClonal’s superiority to exsiting GC-bias correction methods at any level of subclonal population. CONCLUSIONS: Pre-SCNAClonal could be run independently as well as serving as pre-processing/gc-correction step in conjuntion with exsiting SCNA-based subclonal inferring tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2099-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5907144 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59071442018-04-30 Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring Chu, Yanshuo Teng, Mingxiang Wang, Yadong BMC Bioinformatics Research BACKGROUND: Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tools consider the GC bias of tumor and normal sample is of the same fature, and could be fully offset by read count ratio. However, we found that, the read count ratio on SCNA segments presents a Log linear biased pattern, which influence existing read count ratios based subclonal inferring tools performance. Currently no correction tools take into account the read ratio bias. RESULTS: We present Pre-SCNAClonal, a tool that improving tumor subclonal population inferring by correcting GC-bias at SCNAs level. Pre-SCNAClonal first corrects GC bias using Markov chain Monte Carlo probability model, then accurately locates baseline DNA segments (not containing any SCNAs) with a hierarchy clustering model. We show Pre-SCNAClonal’s superiority to exsiting GC-bias correction methods at any level of subclonal population. CONCLUSIONS: Pre-SCNAClonal could be run independently as well as serving as pre-processing/gc-correction step in conjuntion with exsiting SCNA-based subclonal inferring tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2099-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-11 /pmc/articles/PMC5907144/ /pubmed/29671389 http://dx.doi.org/10.1186/s12859-018-2099-0 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Chu, Yanshuo Teng, Mingxiang Wang, Yadong Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title | Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title_full | Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title_fullStr | Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title_full_unstemmed | Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title_short | Modeling and correct the GC bias of tumor and normal WGS data for SCNA based tumor subclonal population inferring |
title_sort | modeling and correct the gc bias of tumor and normal wgs data for scna based tumor subclonal population inferring |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907144/ https://www.ncbi.nlm.nih.gov/pubmed/29671389 http://dx.doi.org/10.1186/s12859-018-2099-0 |
work_keys_str_mv | AT chuyanshuo modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring AT tengmingxiang modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring AT wangyadong modelingandcorrectthegcbiasoftumorandnormalwgsdataforscnabasedtumorsubclonalpopulationinferring |