Cargando…

Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus

BACKGROUND: Over the last decade, metabolomics has evolved into a mainstream enterprise utilized by many laboratories globally. Like other “omics” data, metabolomics data has the characteristics of a smaller sample size compared to the number of features evaluated. Thus the selection of an optimal s...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Suyan, Chang, Howard H, Wang, Chi, Jiang, Jing, Wang, Xiaomei, Niu, Junqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234477/
https://www.ncbi.nlm.nih.gov/pubmed/24707821
http://dx.doi.org/10.1186/1471-2105-15-97
_version_ 1782344868334403584
author Tian, Suyan
Chang, Howard H
Wang, Chi
Jiang, Jing
Wang, Xiaomei
Niu, Junqi
author_facet Tian, Suyan
Chang, Howard H
Wang, Chi
Jiang, Jing
Wang, Xiaomei
Niu, Junqi
author_sort Tian, Suyan
collection PubMed
description BACKGROUND: Over the last decade, metabolomics has evolved into a mainstream enterprise utilized by many laboratories globally. Like other “omics” data, metabolomics data has the characteristics of a smaller sample size compared to the number of features evaluated. Thus the selection of an optimal subset of features with a supervised classifier is imperative. We extended an existing feature selection algorithm, threshold gradient descent regularization (TGDR), to handle multi-class classification of “omics” data, and proposed two such extensions referred to as multi-TGDR. Both multi-TGDR frameworks were used to analyze a metabolomics dataset that compares the metabolic profiles of hepatocellular carcinoma (HCC) infected with hepatitis B (HBV) or C virus (HCV) with that of cirrhosis induced by HBV/HCV infection; the goal was to improve early-stage diagnosis of HCC. RESULTS: We applied two multi-TGDR frameworks to the HCC metabolomics data that determined TGDR thresholds either globally across classes, or locally for each class. Multi-TGDR global model selected 45 metabolites with a 0% misclassification rate (the error rate on the training data) and had a 3.82% 5-fold cross-validation (CV-5) predictive error rate. Multi-TGDR local selected 48 metabolites with a 0% misclassification rate and a 5.34% CV-5 error rate. CONCLUSIONS: One important advantage of multi-TGDR local is that it allows inference for determining which feature is related specifically to the class/classes. Thus, we recommend multi-TGDR local be used because it has similar predictive performance and requires the same computing time as multi-TGDR global, but may provide class-specific inference.
format Online
Article
Text
id pubmed-4234477
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42344772014-11-19 Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus Tian, Suyan Chang, Howard H Wang, Chi Jiang, Jing Wang, Xiaomei Niu, Junqi BMC Bioinformatics Research Article BACKGROUND: Over the last decade, metabolomics has evolved into a mainstream enterprise utilized by many laboratories globally. Like other “omics” data, metabolomics data has the characteristics of a smaller sample size compared to the number of features evaluated. Thus the selection of an optimal subset of features with a supervised classifier is imperative. We extended an existing feature selection algorithm, threshold gradient descent regularization (TGDR), to handle multi-class classification of “omics” data, and proposed two such extensions referred to as multi-TGDR. Both multi-TGDR frameworks were used to analyze a metabolomics dataset that compares the metabolic profiles of hepatocellular carcinoma (HCC) infected with hepatitis B (HBV) or C virus (HCV) with that of cirrhosis induced by HBV/HCV infection; the goal was to improve early-stage diagnosis of HCC. RESULTS: We applied two multi-TGDR frameworks to the HCC metabolomics data that determined TGDR thresholds either globally across classes, or locally for each class. Multi-TGDR global model selected 45 metabolites with a 0% misclassification rate (the error rate on the training data) and had a 3.82% 5-fold cross-validation (CV-5) predictive error rate. Multi-TGDR local selected 48 metabolites with a 0% misclassification rate and a 5.34% CV-5 error rate. CONCLUSIONS: One important advantage of multi-TGDR local is that it allows inference for determining which feature is related specifically to the class/classes. Thus, we recommend multi-TGDR local be used because it has similar predictive performance and requires the same computing time as multi-TGDR global, but may provide class-specific inference. BioMed Central 2014-04-04 /pmc/articles/PMC4234477/ /pubmed/24707821 http://dx.doi.org/10.1186/1471-2105-15-97 Text en Copyright © 2014 Tian et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tian, Suyan
Chang, Howard H
Wang, Chi
Jiang, Jing
Wang, Xiaomei
Niu, Junqi
Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title_full Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title_fullStr Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title_full_unstemmed Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title_short Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus
title_sort multi-tgdr, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis b or hepatitis c virus
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234477/
https://www.ncbi.nlm.nih.gov/pubmed/24707821
http://dx.doi.org/10.1186/1471-2105-15-97
work_keys_str_mv AT tiansuyan multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus
AT changhowardh multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus
AT wangchi multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus
AT jiangjing multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus
AT wangxiaomei multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus
AT niujunqi multitgdramulticlassregularizationmethodidentifiesthemetabolicprofilesofhepatocellularcarcinomaandcirrhosisinfectedwithhepatitisborhepatitiscvirus