Cargando…

Selecting age-related functional characteristics in the human gut microbiome

BACKGROUND: Human gut microbial functions are often associated with various diseases and host physiologies. Aging, a less explored factor, is also suspected to affect or be affected by microbiome alterations. By combining functional feature selection with supervised classification, we aim to facilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Yemin, Kriete, Andres, Rosen, Gail L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3869192/
https://www.ncbi.nlm.nih.gov/pubmed/24467949
http://dx.doi.org/10.1186/2049-2618-1-2
_version_ 1782296535599415296
author Lan, Yemin
Kriete, Andres
Rosen, Gail L
author_facet Lan, Yemin
Kriete, Andres
Rosen, Gail L
author_sort Lan, Yemin
collection PubMed
description BACKGROUND: Human gut microbial functions are often associated with various diseases and host physiologies. Aging, a less explored factor, is also suspected to affect or be affected by microbiome alterations. By combining functional feature selection with supervised classification, we aim to facilitate identification of age-related functional characteristics in metagenomes from several human gut microbiome studies (MetaHIT, MicroAge, MicroObes, Kurokawa et al.’s and Gill et al.’s dataset). RESULTS: We apply two feature selection methods, term frequency-inverse document frequency (TF-iDF) and minimum-redundancy maximum-relevancy (mRMR), to identify functional signatures that differentiate metagenomes by age. After features are reduced, we use a support vector machine (SVM) to predict host age of new metagenomes. Functional features are from protein families (Pfams), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, KEGG ontologies and the Gene Ontology (GO) database. Initial investigations demonstrate that ordination of the functional principal components shows great overlap between different age groups. However, when feature selection is applied, mRMR tightens the ordination cluster for each age group, and TF-iDF offers better linear separation. Both TF-iDF and mRMR were used in conjunction with a SVM classifier and achieved areas under receiver operating characteristic curves (AUCs) 10 to 15% above chance to classify individuals above/below mid-ages (about 38 to 43 years old) using Pfams. Better performance around mid-ages is also observed when using other functional categories and age-balanced dataset. We also identified some age-related Pfams that improved age discrimination at age 65 with another feature selection method called LEfSe, on an age-balanced dataset. The selected functional characteristics identify a broad range of age-relevant metabolisms, such as reduced vitamin B12 synthesis, reduced activity of reductases, increased DNA damage, occurrences of stress responses and immune system compromise, and upregulated glycosyltransferases in the aging population. CONCLUSIONS: Feature selection can yield biologically meaningful results when used in conjunction with classification, and makes age classification of new human gut metagenomes feasible. While we demonstrate the promise of this approach, the data-dependent prediction performance could be further improved. We hypothesize that while the Qin et al. dataset is the most comprehensive to date, even deeper sampling is needed to better characterize and predict the microbiomes’ functional content.
format Online
Article
Text
id pubmed-3869192
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38691922013-12-30 Selecting age-related functional characteristics in the human gut microbiome Lan, Yemin Kriete, Andres Rosen, Gail L Microbiome Research BACKGROUND: Human gut microbial functions are often associated with various diseases and host physiologies. Aging, a less explored factor, is also suspected to affect or be affected by microbiome alterations. By combining functional feature selection with supervised classification, we aim to facilitate identification of age-related functional characteristics in metagenomes from several human gut microbiome studies (MetaHIT, MicroAge, MicroObes, Kurokawa et al.’s and Gill et al.’s dataset). RESULTS: We apply two feature selection methods, term frequency-inverse document frequency (TF-iDF) and minimum-redundancy maximum-relevancy (mRMR), to identify functional signatures that differentiate metagenomes by age. After features are reduced, we use a support vector machine (SVM) to predict host age of new metagenomes. Functional features are from protein families (Pfams), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, KEGG ontologies and the Gene Ontology (GO) database. Initial investigations demonstrate that ordination of the functional principal components shows great overlap between different age groups. However, when feature selection is applied, mRMR tightens the ordination cluster for each age group, and TF-iDF offers better linear separation. Both TF-iDF and mRMR were used in conjunction with a SVM classifier and achieved areas under receiver operating characteristic curves (AUCs) 10 to 15% above chance to classify individuals above/below mid-ages (about 38 to 43 years old) using Pfams. Better performance around mid-ages is also observed when using other functional categories and age-balanced dataset. We also identified some age-related Pfams that improved age discrimination at age 65 with another feature selection method called LEfSe, on an age-balanced dataset. The selected functional characteristics identify a broad range of age-relevant metabolisms, such as reduced vitamin B12 synthesis, reduced activity of reductases, increased DNA damage, occurrences of stress responses and immune system compromise, and upregulated glycosyltransferases in the aging population. CONCLUSIONS: Feature selection can yield biologically meaningful results when used in conjunction with classification, and makes age classification of new human gut metagenomes feasible. While we demonstrate the promise of this approach, the data-dependent prediction performance could be further improved. We hypothesize that while the Qin et al. dataset is the most comprehensive to date, even deeper sampling is needed to better characterize and predict the microbiomes’ functional content. BioMed Central 2013-01-09 /pmc/articles/PMC3869192/ /pubmed/24467949 http://dx.doi.org/10.1186/2049-2618-1-2 Text en Copyright © 2013 Lan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lan, Yemin
Kriete, Andres
Rosen, Gail L
Selecting age-related functional characteristics in the human gut microbiome
title Selecting age-related functional characteristics in the human gut microbiome
title_full Selecting age-related functional characteristics in the human gut microbiome
title_fullStr Selecting age-related functional characteristics in the human gut microbiome
title_full_unstemmed Selecting age-related functional characteristics in the human gut microbiome
title_short Selecting age-related functional characteristics in the human gut microbiome
title_sort selecting age-related functional characteristics in the human gut microbiome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3869192/
https://www.ncbi.nlm.nih.gov/pubmed/24467949
http://dx.doi.org/10.1186/2049-2618-1-2
work_keys_str_mv AT lanyemin selectingagerelatedfunctionalcharacteristicsinthehumangutmicrobiome
AT krieteandres selectingagerelatedfunctionalcharacteristicsinthehumangutmicrobiome
AT rosengaill selectingagerelatedfunctionalcharacteristicsinthehumangutmicrobiome