Cargando…
WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we p...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6895256/ https://www.ncbi.nlm.nih.gov/pubmed/31287869 http://dx.doi.org/10.1093/nar/gkz566 |
_version_ | 1783476559134326784 |
---|---|
author | Jiang, Lin Zheng, Jingjing Kwan, Johnny S H Dai, Sheng Li, Cong Li, Mulin Jun Yu, Bolan TO, Ka F Sham, Pak C Zhu, Yonghong Li, Miaoxin |
author_facet | Jiang, Lin Zheng, Jingjing Kwan, Johnny S H Dai, Sheng Li, Cong Li, Mulin Jun Yu, Bolan TO, Ka F Sham, Pak C Zhu, Yonghong Li, Miaoxin |
author_sort | Jiang, Lin |
collection | PubMed |
description | Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER, http://grass.cgs.hku.hk/limx/witer or KGGSeq,http://grass.cgs.hku.hk/limx/kggseq/), to detect cancer-driver genes showing an excess of somatic mutations. By fitting the distribution of background mutation counts properly, this approach works well even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in most tested cancers. Applying this approach, we estimated 229 driver genes in 26 different types of cancers. In silico validation confirmed 78% of predicted genes as likely known drivers and many other genes as very likely new drivers for corresponding cancers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects and rescue of more genes missed by alternative tools in moderate or small samples. |
format | Online Article Text |
id | pubmed-6895256 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68952562019-12-11 WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts Jiang, Lin Zheng, Jingjing Kwan, Johnny S H Dai, Sheng Li, Cong Li, Mulin Jun Yu, Bolan TO, Ka F Sham, Pak C Zhu, Yonghong Li, Miaoxin Nucleic Acids Res Methods Online Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER, http://grass.cgs.hku.hk/limx/witer or KGGSeq,http://grass.cgs.hku.hk/limx/kggseq/), to detect cancer-driver genes showing an excess of somatic mutations. By fitting the distribution of background mutation counts properly, this approach works well even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in most tested cancers. Applying this approach, we estimated 229 driver genes in 26 different types of cancers. In silico validation confirmed 78% of predicted genes as likely known drivers and many other genes as very likely new drivers for corresponding cancers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects and rescue of more genes missed by alternative tools in moderate or small samples. Oxford University Press 2019-09-19 2019-07-09 /pmc/articles/PMC6895256/ /pubmed/31287869 http://dx.doi.org/10.1093/nar/gkz566 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Jiang, Lin Zheng, Jingjing Kwan, Johnny S H Dai, Sheng Li, Cong Li, Mulin Jun Yu, Bolan TO, Ka F Sham, Pak C Zhu, Yonghong Li, Miaoxin WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title | WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title_full | WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title_fullStr | WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title_full_unstemmed | WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title_short | WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
title_sort | witer: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6895256/ https://www.ncbi.nlm.nih.gov/pubmed/31287869 http://dx.doi.org/10.1093/nar/gkz566 |
work_keys_str_mv | AT jianglin witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT zhengjingjing witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT kwanjohnnysh witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT daisheng witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT licong witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT limulinjun witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT yubolan witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT tokaf witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT shampakc witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT zhuyonghong witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts AT limiaoxin witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts |