Cargando…

Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data

The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detect...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhenqiu, Chen, Dechang, Sheng, Li, Liu, Amy Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608598/
https://www.ncbi.nlm.nih.gov/pubmed/23555553
http://dx.doi.org/10.1371/journal.pone.0053253
_version_ 1782264252284796928
author Liu, Zhenqiu
Chen, Dechang
Sheng, Li
Liu, Amy Y.
author_facet Liu, Zhenqiu
Chen, Dechang
Sheng, Li
Liu, Amy Y.
author_sort Liu, Zhenqiu
collection PubMed
description The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning method for simultaneous association prediction and feature selection with metagenomic samples from two or multiple treatment populations on the basis of count data. We developed a linear programming based support vector machine with [Image: see text] and joint [Image: see text] penalties for binary and multiclass classifications with metagenomic count data (metalinprog). We evaluated the performance of our method on several real and simulation datasets. The proposed method can simultaneously identify features and predict classes with the metagenomic count data.
format Online
Article
Text
id pubmed-3608598
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36085982013-04-03 Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data Liu, Zhenqiu Chen, Dechang Sheng, Li Liu, Amy Y. PLoS One Research Article The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning method for simultaneous association prediction and feature selection with metagenomic samples from two or multiple treatment populations on the basis of count data. We developed a linear programming based support vector machine with [Image: see text] and joint [Image: see text] penalties for binary and multiclass classifications with metagenomic count data (metalinprog). We evaluated the performance of our method on several real and simulation datasets. The proposed method can simultaneously identify features and predict classes with the metagenomic count data. Public Library of Science 2013-03-26 /pmc/articles/PMC3608598/ /pubmed/23555553 http://dx.doi.org/10.1371/journal.pone.0053253 Text en © 2013 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liu, Zhenqiu
Chen, Dechang
Sheng, Li
Liu, Amy Y.
Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title_full Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title_fullStr Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title_full_unstemmed Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title_short Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data
title_sort class prediction and feature selection with linear optimization for metagenomic count data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608598/
https://www.ncbi.nlm.nih.gov/pubmed/23555553
http://dx.doi.org/10.1371/journal.pone.0053253
work_keys_str_mv AT liuzhenqiu classpredictionandfeatureselectionwithlinearoptimizationformetagenomiccountdata
AT chendechang classpredictionandfeatureselectionwithlinearoptimizationformetagenomiccountdata
AT shengli classpredictionandfeatureselectionwithlinearoptimizationformetagenomiccountdata
AT liuamyy classpredictionandfeatureselectionwithlinearoptimizationformetagenomiccountdata