Cargando…
A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5339694/ https://www.ncbi.nlm.nih.gov/pubmed/28266614 http://dx.doi.org/10.1038/srep43238 |
_version_ | 1782512708529160192 |
---|---|
author | Wood, G. Craig Chu, Xin Argyropoulos, George Benotti, Peter Rolston, David Mirshahi, Tooraj Petrick, Anthony Gabrielson, John Carey, David J. DiStefano, Johanna K. Still, Christopher D. Gerhard, Glenn S. |
author_facet | Wood, G. Craig Chu, Xin Argyropoulos, George Benotti, Peter Rolston, David Mirshahi, Tooraj Petrick, Anthony Gabrielson, John Carey, David J. DiStefano, Johanna K. Still, Christopher D. Gerhard, Glenn S. |
author_sort | Wood, G. Craig |
collection | PubMed |
description | Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed a multi-component classifier for hepatic steatosis comprised of phenotypic, genomic, and proteomic variables using data from 576 adults with extreme obesity who underwent bariatric surgery and intra-operative liver biopsy. Using a 443 patient training set, protein biomarker discovery was performed using the highly multiplexed SOMAscan(®) proteomic assay, a set of 19 clinical variables, and the steatosis predisposing PNPLA3 rs738409 single nucleotide polymorphism genotype status. The most stable markers were selected using a stability selection algorithm with a L(1)-regularized logistic regression kernel and were then fitted with logistic regression models to classify steatosis, that were then tested against a 133 sample blinded verification set. The highest area under the ROC curve (AUC) for steatosis of PNPLA3 rs738409 genotype, 8 proteins, or 19 phenotypic variables was 0.913, whereas the final classifier that included variables from all three domains had an AUC of 0.935. These data indicate that multi-domain modeling has better predictive power than comprehensive analysis of variables from a single domain. |
format | Online Article Text |
id | pubmed-5339694 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53396942017-03-10 A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains Wood, G. Craig Chu, Xin Argyropoulos, George Benotti, Peter Rolston, David Mirshahi, Tooraj Petrick, Anthony Gabrielson, John Carey, David J. DiStefano, Johanna K. Still, Christopher D. Gerhard, Glenn S. Sci Rep Article Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed a multi-component classifier for hepatic steatosis comprised of phenotypic, genomic, and proteomic variables using data from 576 adults with extreme obesity who underwent bariatric surgery and intra-operative liver biopsy. Using a 443 patient training set, protein biomarker discovery was performed using the highly multiplexed SOMAscan(®) proteomic assay, a set of 19 clinical variables, and the steatosis predisposing PNPLA3 rs738409 single nucleotide polymorphism genotype status. The most stable markers were selected using a stability selection algorithm with a L(1)-regularized logistic regression kernel and were then fitted with logistic regression models to classify steatosis, that were then tested against a 133 sample blinded verification set. The highest area under the ROC curve (AUC) for steatosis of PNPLA3 rs738409 genotype, 8 proteins, or 19 phenotypic variables was 0.913, whereas the final classifier that included variables from all three domains had an AUC of 0.935. These data indicate that multi-domain modeling has better predictive power than comprehensive analysis of variables from a single domain. Nature Publishing Group 2017-03-07 /pmc/articles/PMC5339694/ /pubmed/28266614 http://dx.doi.org/10.1038/srep43238 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Wood, G. Craig Chu, Xin Argyropoulos, George Benotti, Peter Rolston, David Mirshahi, Tooraj Petrick, Anthony Gabrielson, John Carey, David J. DiStefano, Johanna K. Still, Christopher D. Gerhard, Glenn S. A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title | A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title_full | A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title_fullStr | A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title_full_unstemmed | A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title_short | A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains |
title_sort | multi-component classifier for nonalcoholic fatty liver disease (nafld) based on genomic, proteomic, and phenomic data domains |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5339694/ https://www.ncbi.nlm.nih.gov/pubmed/28266614 http://dx.doi.org/10.1038/srep43238 |
work_keys_str_mv | AT woodgcraig amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT chuxin amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT argyropoulosgeorge amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT benottipeter amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT rolstondavid amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT mirshahitooraj amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT petrickanthony amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT gabrielsonjohn amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT careydavidj amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT distefanojohannak amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT stillchristopherd amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT gerhardglenns amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT woodgcraig multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT chuxin multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT argyropoulosgeorge multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT benottipeter multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT rolstondavid multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT mirshahitooraj multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT petrickanthony multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT gabrielsonjohn multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT careydavidj multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT distefanojohannak multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT stillchristopherd multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains AT gerhardglenns multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains |