Cargando…

Convolutional neural networks improve fungal classification

Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Duong, Groenewald, Marizeth, Verkley, Gerard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387343/
https://www.ncbi.nlm.nih.gov/pubmed/32724224
http://dx.doi.org/10.1038/s41598-020-69245-y
_version_ 1783564101400657920
author Vu, Duong
Groenewald, Marizeth
Verkley, Gerard
author_facet Vu, Duong
Groenewald, Marizeth
Verkley, Gerard
author_sort Vu, Duong
collection PubMed
description Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural network (CNN) and a deep belief network (DBN) were trained using our recently released barcode datasets. Experimental results show that CNN outperformed the traditional BLAST classification and the most accurate machine learning based Ribosomal Database Project (RDP) classifier on datasets that had many of the labels present in the training datasets. When classifying an independent dataset namely the “Top 50 Most Wanted Fungi”, CNN and DBN assigned less sequences than BLAST. However, they could assign much more sequences than the RDP classifier. In terms of efficiency, it took the machine learning classifiers up to two seconds to classify a test dataset while it was 53 s for BLAST. The result of the current study will enable us to speed up the taxonomic assignments for the fungal barcode sequences generated at our institute as ~ 70% of them still need to be validated for public release. In addition, it will help to quickly provide a taxonomic profile for metagenomics samples.
format Online
Article
Text
id pubmed-7387343
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73873432020-07-29 Convolutional neural networks improve fungal classification Vu, Duong Groenewald, Marizeth Verkley, Gerard Sci Rep Article Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural network (CNN) and a deep belief network (DBN) were trained using our recently released barcode datasets. Experimental results show that CNN outperformed the traditional BLAST classification and the most accurate machine learning based Ribosomal Database Project (RDP) classifier on datasets that had many of the labels present in the training datasets. When classifying an independent dataset namely the “Top 50 Most Wanted Fungi”, CNN and DBN assigned less sequences than BLAST. However, they could assign much more sequences than the RDP classifier. In terms of efficiency, it took the machine learning classifiers up to two seconds to classify a test dataset while it was 53 s for BLAST. The result of the current study will enable us to speed up the taxonomic assignments for the fungal barcode sequences generated at our institute as ~ 70% of them still need to be validated for public release. In addition, it will help to quickly provide a taxonomic profile for metagenomics samples. Nature Publishing Group UK 2020-07-28 /pmc/articles/PMC7387343/ /pubmed/32724224 http://dx.doi.org/10.1038/s41598-020-69245-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Vu, Duong
Groenewald, Marizeth
Verkley, Gerard
Convolutional neural networks improve fungal classification
title Convolutional neural networks improve fungal classification
title_full Convolutional neural networks improve fungal classification
title_fullStr Convolutional neural networks improve fungal classification
title_full_unstemmed Convolutional neural networks improve fungal classification
title_short Convolutional neural networks improve fungal classification
title_sort convolutional neural networks improve fungal classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387343/
https://www.ncbi.nlm.nih.gov/pubmed/32724224
http://dx.doi.org/10.1038/s41598-020-69245-y
work_keys_str_mv AT vuduong convolutionalneuralnetworksimprovefungalclassification
AT groenewaldmarizeth convolutionalneuralnetworksimprovefungalclassification
AT verkleygerard convolutionalneuralnetworksimprovefungalclassification