Cargando…

Differential distribution improves gene selection stability and has competitive classification performance for patient survival

A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expr...

Descripción completa

Detalles Bibliográficos
Autores principales: Strbenac, Dario, Mann, Graham J., Yang, Jean Y.H., Ormerod, John T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291264/
https://www.ncbi.nlm.nih.gov/pubmed/27190235
http://dx.doi.org/10.1093/nar/gkw444
_version_ 1782504751442690048
author Strbenac, Dario
Mann, Graham J.
Yang, Jean Y.H.
Ormerod, John T.
author_facet Strbenac, Dario
Mann, Graham J.
Yang, Jean Y.H.
Ormerod, John T.
author_sort Strbenac, Dario
collection PubMed
description A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of important biomarkers that could lead to improved predictions and treatments. Here, we describe a new approach for assessing the importance of genes based on differential distribution (DD), which combines information from differential expression and differential variability into a unified metric. We show that feature ranking and selection stability based on DD can perform two to three times better than DE or DV alone, and that DD yields equivalent error rates to DE and DV. Finally, assessing genes via differential distribution produces a complementary set of selected genes to DE and DV, potentially opening up new categories of biomarkers.
format Online
Article
Text
id pubmed-5291264
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-52912642017-02-10 Differential distribution improves gene selection stability and has competitive classification performance for patient survival Strbenac, Dario Mann, Graham J. Yang, Jean Y.H. Ormerod, John T. Nucleic Acids Res Methods Online A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of important biomarkers that could lead to improved predictions and treatments. Here, we describe a new approach for assessing the importance of genes based on differential distribution (DD), which combines information from differential expression and differential variability into a unified metric. We show that feature ranking and selection stability based on DD can perform two to three times better than DE or DV alone, and that DD yields equivalent error rates to DE and DV. Finally, assessing genes via differential distribution produces a complementary set of selected genes to DE and DV, potentially opening up new categories of biomarkers. Oxford University Press 2016-07-27 2016-05-17 /pmc/articles/PMC5291264/ /pubmed/27190235 http://dx.doi.org/10.1093/nar/gkw444 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Strbenac, Dario
Mann, Graham J.
Yang, Jean Y.H.
Ormerod, John T.
Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title_full Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title_fullStr Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title_full_unstemmed Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title_short Differential distribution improves gene selection stability and has competitive classification performance for patient survival
title_sort differential distribution improves gene selection stability and has competitive classification performance for patient survival
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291264/
https://www.ncbi.nlm.nih.gov/pubmed/27190235
http://dx.doi.org/10.1093/nar/gkw444
work_keys_str_mv AT strbenacdario differentialdistributionimprovesgeneselectionstabilityandhascompetitiveclassificationperformanceforpatientsurvival
AT manngrahamj differentialdistributionimprovesgeneselectionstabilityandhascompetitiveclassificationperformanceforpatientsurvival
AT yangjeanyh differentialdistributionimprovesgeneselectionstabilityandhascompetitiveclassificationperformanceforpatientsurvival
AT ormerodjohnt differentialdistributionimprovesgeneselectionstabilityandhascompetitiveclassificationperformanceforpatientsurvival