Cargando…

Phylogenetic convolutional neural networks in metagenomics

BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Fioravanti, Diego, Giarratano, Ylenia, Maggio, Valerio, Agostinelli, Claudio, Chierici, Marco, Jurman, Giuseppe, Furlanello, Cesare
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850953/
https://www.ncbi.nlm.nih.gov/pubmed/29536822
http://dx.doi.org/10.1186/s12859-018-2033-5
_version_ 1783306311934410752
author Fioravanti, Diego
Giarratano, Ylenia
Maggio, Valerio
Agostinelli, Claudio
Chierici, Marco
Jurman, Giuseppe
Furlanello, Cesare
author_facet Fioravanti, Diego
Giarratano, Ylenia
Maggio, Valerio
Agostinelli, Claudio
Chierici, Marco
Jurman, Giuseppe
Furlanello, Cesare
author_sort Fioravanti, Diego
collection PubMed
description BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. RESULTS: Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. CONCLUSION: Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user.
format Online
Article
Text
id pubmed-5850953
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58509532018-03-21 Phylogenetic convolutional neural networks in metagenomics Fioravanti, Diego Giarratano, Ylenia Maggio, Valerio Agostinelli, Claudio Chierici, Marco Jurman, Giuseppe Furlanello, Cesare BMC Bioinformatics Research BACKGROUND: Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. RESULTS: Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. CONCLUSION: Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user. BioMed Central 2018-03-08 /pmc/articles/PMC5850953/ /pubmed/29536822 http://dx.doi.org/10.1186/s12859-018-2033-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Fioravanti, Diego
Giarratano, Ylenia
Maggio, Valerio
Agostinelli, Claudio
Chierici, Marco
Jurman, Giuseppe
Furlanello, Cesare
Phylogenetic convolutional neural networks in metagenomics
title Phylogenetic convolutional neural networks in metagenomics
title_full Phylogenetic convolutional neural networks in metagenomics
title_fullStr Phylogenetic convolutional neural networks in metagenomics
title_full_unstemmed Phylogenetic convolutional neural networks in metagenomics
title_short Phylogenetic convolutional neural networks in metagenomics
title_sort phylogenetic convolutional neural networks in metagenomics
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850953/
https://www.ncbi.nlm.nih.gov/pubmed/29536822
http://dx.doi.org/10.1186/s12859-018-2033-5
work_keys_str_mv AT fioravantidiego phylogeneticconvolutionalneuralnetworksinmetagenomics
AT giarratanoylenia phylogeneticconvolutionalneuralnetworksinmetagenomics
AT maggiovalerio phylogeneticconvolutionalneuralnetworksinmetagenomics
AT agostinelliclaudio phylogeneticconvolutionalneuralnetworksinmetagenomics
AT chiericimarco phylogeneticconvolutionalneuralnetworksinmetagenomics
AT jurmangiuseppe phylogeneticconvolutionalneuralnetworksinmetagenomics
AT furlanellocesare phylogeneticconvolutionalneuralnetworksinmetagenomics