Cargando…
Phylogenetic convolutional neural networks in metagenomics
BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data bas...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850953/ https://www.ncbi.nlm.nih.gov/pubmed/29536822 http://dx.doi.org/10.1186/s12859-018-2033-5 |
_version_ | 1783306311934410752 |
---|---|
author | Fioravanti, Diego Giarratano, Ylenia Maggio, Valerio Agostinelli, Claudio Chierici, Marco Jurman, Giuseppe Furlanello, Cesare |
author_facet | Fioravanti, Diego Giarratano, Ylenia Maggio, Valerio Agostinelli, Claudio Chierici, Marco Jurman, Giuseppe Furlanello, Cesare |
author_sort | Fioravanti, Diego |
collection | PubMed |
description | BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. RESULTS: Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. CONCLUSION: Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user. |
format | Online Article Text |
id | pubmed-5850953 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58509532018-03-21 Phylogenetic convolutional neural networks in metagenomics Fioravanti, Diego Giarratano, Ylenia Maggio, Valerio Agostinelli, Claudio Chierici, Marco Jurman, Giuseppe Furlanello, Cesare BMC Bioinformatics Research BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. RESULTS: Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. CONCLUSION: Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user. BioMed Central 2018-03-08 /pmc/articles/PMC5850953/ /pubmed/29536822 http://dx.doi.org/10.1186/s12859-018-2033-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Fioravanti, Diego Giarratano, Ylenia Maggio, Valerio Agostinelli, Claudio Chierici, Marco Jurman, Giuseppe Furlanello, Cesare Phylogenetic convolutional neural networks in metagenomics |
title | Phylogenetic convolutional neural networks in metagenomics |
title_full | Phylogenetic convolutional neural networks in metagenomics |
title_fullStr | Phylogenetic convolutional neural networks in metagenomics |
title_full_unstemmed | Phylogenetic convolutional neural networks in metagenomics |
title_short | Phylogenetic convolutional neural networks in metagenomics |
title_sort | phylogenetic convolutional neural networks in metagenomics |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850953/ https://www.ncbi.nlm.nih.gov/pubmed/29536822 http://dx.doi.org/10.1186/s12859-018-2033-5 |
work_keys_str_mv | AT fioravantidiego phylogeneticconvolutionalneuralnetworksinmetagenomics AT giarratanoylenia phylogeneticconvolutionalneuralnetworksinmetagenomics AT maggiovalerio phylogeneticconvolutionalneuralnetworksinmetagenomics AT agostinelliclaudio phylogeneticconvolutionalneuralnetworksinmetagenomics AT chiericimarco phylogeneticconvolutionalneuralnetworksinmetagenomics AT jurmangiuseppe phylogeneticconvolutionalneuralnetworksinmetagenomics AT furlanellocesare phylogeneticconvolutionalneuralnetworksinmetagenomics |