Cargando…

A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains

Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed...

Descripción completa

Detalles Bibliográficos
Autores principales: Wood, G. Craig, Chu, Xin, Argyropoulos, George, Benotti, Peter, Rolston, David, Mirshahi, Tooraj, Petrick, Anthony, Gabrielson, John, Carey, David J., DiStefano, Johanna K., Still, Christopher D., Gerhard, Glenn S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5339694/
https://www.ncbi.nlm.nih.gov/pubmed/28266614
http://dx.doi.org/10.1038/srep43238
_version_ 1782512708529160192
author Wood, G. Craig
Chu, Xin
Argyropoulos, George
Benotti, Peter
Rolston, David
Mirshahi, Tooraj
Petrick, Anthony
Gabrielson, John
Carey, David J.
DiStefano, Johanna K.
Still, Christopher D.
Gerhard, Glenn S.
author_facet Wood, G. Craig
Chu, Xin
Argyropoulos, George
Benotti, Peter
Rolston, David
Mirshahi, Tooraj
Petrick, Anthony
Gabrielson, John
Carey, David J.
DiStefano, Johanna K.
Still, Christopher D.
Gerhard, Glenn S.
author_sort Wood, G. Craig
collection PubMed
description Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed a multi-component classifier for hepatic steatosis comprised of phenotypic, genomic, and proteomic variables using data from 576 adults with extreme obesity who underwent bariatric surgery and intra-operative liver biopsy. Using a 443 patient training set, protein biomarker discovery was performed using the highly multiplexed SOMAscan(®) proteomic assay, a set of 19 clinical variables, and the steatosis predisposing PNPLA3 rs738409 single nucleotide polymorphism genotype status. The most stable markers were selected using a stability selection algorithm with a L(1)-regularized logistic regression kernel and were then fitted with logistic regression models to classify steatosis, that were then tested against a 133 sample blinded verification set. The highest area under the ROC curve (AUC) for steatosis of PNPLA3 rs738409 genotype, 8 proteins, or 19 phenotypic variables was 0.913, whereas the final classifier that included variables from all three domains had an AUC of 0.935. These data indicate that multi-domain modeling has better predictive power than comprehensive analysis of variables from a single domain.
format Online
Article
Text
id pubmed-5339694
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53396942017-03-10 A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains Wood, G. Craig Chu, Xin Argyropoulos, George Benotti, Peter Rolston, David Mirshahi, Tooraj Petrick, Anthony Gabrielson, John Carey, David J. DiStefano, Johanna K. Still, Christopher D. Gerhard, Glenn S. Sci Rep Article Non-alcoholic fatty liver disease (NAFLD) represents a spectrum of conditions that include steatohepatitis and fibrosis that are thought to emanate from hepatic steatosis. Few robust biomarkers or diagnostic tests have been developed for hepatic steatosis in the setting of obesity. We have developed a multi-component classifier for hepatic steatosis comprised of phenotypic, genomic, and proteomic variables using data from 576 adults with extreme obesity who underwent bariatric surgery and intra-operative liver biopsy. Using a 443 patient training set, protein biomarker discovery was performed using the highly multiplexed SOMAscan(®) proteomic assay, a set of 19 clinical variables, and the steatosis predisposing PNPLA3 rs738409 single nucleotide polymorphism genotype status. The most stable markers were selected using a stability selection algorithm with a L(1)-regularized logistic regression kernel and were then fitted with logistic regression models to classify steatosis, that were then tested against a 133 sample blinded verification set. The highest area under the ROC curve (AUC) for steatosis of PNPLA3 rs738409 genotype, 8 proteins, or 19 phenotypic variables was 0.913, whereas the final classifier that included variables from all three domains had an AUC of 0.935. These data indicate that multi-domain modeling has better predictive power than comprehensive analysis of variables from a single domain. Nature Publishing Group 2017-03-07 /pmc/articles/PMC5339694/ /pubmed/28266614 http://dx.doi.org/10.1038/srep43238 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Wood, G. Craig
Chu, Xin
Argyropoulos, George
Benotti, Peter
Rolston, David
Mirshahi, Tooraj
Petrick, Anthony
Gabrielson, John
Carey, David J.
DiStefano, Johanna K.
Still, Christopher D.
Gerhard, Glenn S.
A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title_full A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title_fullStr A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title_full_unstemmed A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title_short A multi-component classifier for nonalcoholic fatty liver disease (NAFLD) based on genomic, proteomic, and phenomic data domains
title_sort multi-component classifier for nonalcoholic fatty liver disease (nafld) based on genomic, proteomic, and phenomic data domains
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5339694/
https://www.ncbi.nlm.nih.gov/pubmed/28266614
http://dx.doi.org/10.1038/srep43238
work_keys_str_mv AT woodgcraig amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT chuxin amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT argyropoulosgeorge amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT benottipeter amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT rolstondavid amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT mirshahitooraj amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT petrickanthony amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT gabrielsonjohn amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT careydavidj amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT distefanojohannak amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT stillchristopherd amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT gerhardglenns amulticomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT woodgcraig multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT chuxin multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT argyropoulosgeorge multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT benottipeter multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT rolstondavid multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT mirshahitooraj multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT petrickanthony multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT gabrielsonjohn multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT careydavidj multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT distefanojohannak multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT stillchristopherd multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains
AT gerhardglenns multicomponentclassifierfornonalcoholicfattyliverdiseasenafldbasedongenomicproteomicandphenomicdatadomains