Cargando…
A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease
SUMMARY: Gene-based supervised machine learning classification models have been widely used to differentiate disease states, predict disease progression and determine effective treatment options. However, many of these classifiers are sensitive to noise and frequently do not replicate in external va...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860179/ https://www.ncbi.nlm.nih.gov/pubmed/29048458 http://dx.doi.org/10.1093/bioinformatics/btx651 |
_version_ | 1783307957212020736 |
---|---|
author | Han, Lichy Maciejewski, Mateusz Brockel, Christoph Gordon, William Snapper, Scott B Korzenik, Joshua R Afzelius, Lovisa Altman, Russ B |
author_facet | Han, Lichy Maciejewski, Mateusz Brockel, Christoph Gordon, William Snapper, Scott B Korzenik, Joshua R Afzelius, Lovisa Altman, Russ B |
author_sort | Han, Lichy |
collection | PubMed |
description | SUMMARY: Gene-based supervised machine learning classification models have been widely used to differentiate disease states, predict disease progression and determine effective treatment options. However, many of these classifiers are sensitive to noise and frequently do not replicate in external validation sets. For complex, heterogeneous diseases, these classifiers are further limited by being unable to capture varying combinations of genes that lead to the same phenotype. Pathway-based classification can overcome these challenges by using robust, aggregate features to represent biological mechanisms. In this work, we developed a novel pathway-based approach, PRObabilistic Pathway Score, which uses genes to calculate individualized pathway scores for classification. Unlike previous individualized pathway-based classification methods that use gene sets, we incorporate gene interactions using probabilistic graphical models to more accurately represent the underlying biology and achieve better performance. We apply our method to differentiate two similar complex diseases, ulcerative colitis (UC) and Crohn’s disease (CD), which are the two main types of inflammatory bowel disease (IBD). Using five IBD datasets, we compare our method against four gene-based and four alternative pathway-based classifiers in distinguishing CD from UC. We demonstrate superior classification performance and provide biological insight into the top pathways separating CD from UC. AVAILABILITY AND IMPLEMENTATION: PROPS is available as a R package, which can be downloaded at http://simtk.org/home/props or on Bioconductor. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5860179 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58601792018-03-21 A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease Han, Lichy Maciejewski, Mateusz Brockel, Christoph Gordon, William Snapper, Scott B Korzenik, Joshua R Afzelius, Lovisa Altman, Russ B Bioinformatics Original Papers SUMMARY: Gene-based supervised machine learning classification models have been widely used to differentiate disease states, predict disease progression and determine effective treatment options. However, many of these classifiers are sensitive to noise and frequently do not replicate in external validation sets. For complex, heterogeneous diseases, these classifiers are further limited by being unable to capture varying combinations of genes that lead to the same phenotype. Pathway-based classification can overcome these challenges by using robust, aggregate features to represent biological mechanisms. In this work, we developed a novel pathway-based approach, PRObabilistic Pathway Score, which uses genes to calculate individualized pathway scores for classification. Unlike previous individualized pathway-based classification methods that use gene sets, we incorporate gene interactions using probabilistic graphical models to more accurately represent the underlying biology and achieve better performance. We apply our method to differentiate two similar complex diseases, ulcerative colitis (UC) and Crohn’s disease (CD), which are the two main types of inflammatory bowel disease (IBD). Using five IBD datasets, we compare our method against four gene-based and four alternative pathway-based classifiers in distinguishing CD from UC. We demonstrate superior classification performance and provide biological insight into the top pathways separating CD from UC. AVAILABILITY AND IMPLEMENTATION: PROPS is available as a R package, which can be downloaded at http://simtk.org/home/props or on Bioconductor. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-03-15 2017-10-18 /pmc/articles/PMC5860179/ /pubmed/29048458 http://dx.doi.org/10.1093/bioinformatics/btx651 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Han, Lichy Maciejewski, Mateusz Brockel, Christoph Gordon, William Snapper, Scott B Korzenik, Joshua R Afzelius, Lovisa Altman, Russ B A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title | A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title_full | A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title_fullStr | A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title_full_unstemmed | A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title_short | A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease |
title_sort | probabilistic pathway score (props) for classification with applications to inflammatory bowel disease |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860179/ https://www.ncbi.nlm.nih.gov/pubmed/29048458 http://dx.doi.org/10.1093/bioinformatics/btx651 |
work_keys_str_mv | AT hanlichy aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT maciejewskimateusz aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT brockelchristoph aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT gordonwilliam aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT snapperscottb aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT korzenikjoshuar aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT afzeliuslovisa aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT altmanrussb aprobabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT hanlichy probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT maciejewskimateusz probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT brockelchristoph probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT gordonwilliam probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT snapperscottb probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT korzenikjoshuar probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT afzeliuslovisa probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease AT altmanrussb probabilisticpathwayscorepropsforclassificationwithapplicationstoinflammatoryboweldisease |