Cargando…

WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts

Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we p...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Lin, Zheng, Jingjing, Kwan, Johnny S H, Dai, Sheng, Li, Cong, Li, Mulin Jun, Yu, Bolan, TO, Ka F, Sham, Pak C, Zhu, Yonghong, Li, Miaoxin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6895256/
https://www.ncbi.nlm.nih.gov/pubmed/31287869
http://dx.doi.org/10.1093/nar/gkz566
_version_ 1783476559134326784
author Jiang, Lin
Zheng, Jingjing
Kwan, Johnny S H
Dai, Sheng
Li, Cong
Li, Mulin Jun
Yu, Bolan
TO, Ka F
Sham, Pak C
Zhu, Yonghong
Li, Miaoxin
author_facet Jiang, Lin
Zheng, Jingjing
Kwan, Johnny S H
Dai, Sheng
Li, Cong
Li, Mulin Jun
Yu, Bolan
TO, Ka F
Sham, Pak C
Zhu, Yonghong
Li, Miaoxin
author_sort Jiang, Lin
collection PubMed
description Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER, http://grass.cgs.hku.hk/limx/witer or KGGSeq,http://grass.cgs.hku.hk/limx/kggseq/), to detect cancer-driver genes showing an excess of somatic mutations. By fitting the distribution of background mutation counts properly, this approach works well even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in most tested cancers. Applying this approach, we estimated 229 driver genes in 26 different types of cancers. In silico validation confirmed 78% of predicted genes as likely known drivers and many other genes as very likely new drivers for corresponding cancers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects and rescue of more genes missed by alternative tools in moderate or small samples.
format Online
Article
Text
id pubmed-6895256
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68952562019-12-11 WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts Jiang, Lin Zheng, Jingjing Kwan, Johnny S H Dai, Sheng Li, Cong Li, Mulin Jun Yu, Bolan TO, Ka F Sham, Pak C Zhu, Yonghong Li, Miaoxin Nucleic Acids Res Methods Online Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER, http://grass.cgs.hku.hk/limx/witer or KGGSeq,http://grass.cgs.hku.hk/limx/kggseq/), to detect cancer-driver genes showing an excess of somatic mutations. By fitting the distribution of background mutation counts properly, this approach works well even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in most tested cancers. Applying this approach, we estimated 229 driver genes in 26 different types of cancers. In silico validation confirmed 78% of predicted genes as likely known drivers and many other genes as very likely new drivers for corresponding cancers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects and rescue of more genes missed by alternative tools in moderate or small samples. Oxford University Press 2019-09-19 2019-07-09 /pmc/articles/PMC6895256/ /pubmed/31287869 http://dx.doi.org/10.1093/nar/gkz566 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Jiang, Lin
Zheng, Jingjing
Kwan, Johnny S H
Dai, Sheng
Li, Cong
Li, Mulin Jun
Yu, Bolan
TO, Ka F
Sham, Pak C
Zhu, Yonghong
Li, Miaoxin
WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title_full WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title_fullStr WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title_full_unstemmed WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title_short WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
title_sort witer: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6895256/
https://www.ncbi.nlm.nih.gov/pubmed/31287869
http://dx.doi.org/10.1093/nar/gkz566
work_keys_str_mv AT jianglin witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT zhengjingjing witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT kwanjohnnysh witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT daisheng witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT licong witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT limulinjun witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT yubolan witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT tokaf witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT shampakc witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT zhuyonghong witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts
AT limiaoxin witerapowerfulmethodforestimationofcancerdrivergenesusingaweightediterativeregressionmodellingbackgroundmutationcounts