Cargando…
Convolutional neural networks improve fungal classification
Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural ne...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387343/ https://www.ncbi.nlm.nih.gov/pubmed/32724224 http://dx.doi.org/10.1038/s41598-020-69245-y |
_version_ | 1783564101400657920 |
---|---|
author | Vu, Duong Groenewald, Marizeth Verkley, Gerard |
author_facet | Vu, Duong Groenewald, Marizeth Verkley, Gerard |
author_sort | Vu, Duong |
collection | PubMed |
description | Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural network (CNN) and a deep belief network (DBN) were trained using our recently released barcode datasets. Experimental results show that CNN outperformed the traditional BLAST classification and the most accurate machine learning based Ribosomal Database Project (RDP) classifier on datasets that had many of the labels present in the training datasets. When classifying an independent dataset namely the “Top 50 Most Wanted Fungi”, CNN and DBN assigned less sequences than BLAST. However, they could assign much more sequences than the RDP classifier. In terms of efficiency, it took the machine learning classifiers up to two seconds to classify a test dataset while it was 53 s for BLAST. The result of the current study will enable us to speed up the taxonomic assignments for the fungal barcode sequences generated at our institute as ~ 70% of them still need to be validated for public release. In addition, it will help to quickly provide a taxonomic profile for metagenomics samples. |
format | Online Article Text |
id | pubmed-7387343 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-73873432020-07-29 Convolutional neural networks improve fungal classification Vu, Duong Groenewald, Marizeth Verkley, Gerard Sci Rep Article Sequence classification plays an important role in metagenomics studies. We assess the deep neural network approach for fungal sequence classification as it has emerged as a successful paradigm for big data classification and clustering. Two deep learning-based classifiers, a convolutional neural network (CNN) and a deep belief network (DBN) were trained using our recently released barcode datasets. Experimental results show that CNN outperformed the traditional BLAST classification and the most accurate machine learning based Ribosomal Database Project (RDP) classifier on datasets that had many of the labels present in the training datasets. When classifying an independent dataset namely the “Top 50 Most Wanted Fungi”, CNN and DBN assigned less sequences than BLAST. However, they could assign much more sequences than the RDP classifier. In terms of efficiency, it took the machine learning classifiers up to two seconds to classify a test dataset while it was 53 s for BLAST. The result of the current study will enable us to speed up the taxonomic assignments for the fungal barcode sequences generated at our institute as ~ 70% of them still need to be validated for public release. In addition, it will help to quickly provide a taxonomic profile for metagenomics samples. Nature Publishing Group UK 2020-07-28 /pmc/articles/PMC7387343/ /pubmed/32724224 http://dx.doi.org/10.1038/s41598-020-69245-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Vu, Duong Groenewald, Marizeth Verkley, Gerard Convolutional neural networks improve fungal classification |
title | Convolutional neural networks improve fungal classification |
title_full | Convolutional neural networks improve fungal classification |
title_fullStr | Convolutional neural networks improve fungal classification |
title_full_unstemmed | Convolutional neural networks improve fungal classification |
title_short | Convolutional neural networks improve fungal classification |
title_sort | convolutional neural networks improve fungal classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387343/ https://www.ncbi.nlm.nih.gov/pubmed/32724224 http://dx.doi.org/10.1038/s41598-020-69245-y |
work_keys_str_mv | AT vuduong convolutionalneuralnetworksimprovefungalclassification AT groenewaldmarizeth convolutionalneuralnetworksimprovefungalclassification AT verkleygerard convolutionalneuralnetworksimprovefungalclassification |