Cargando…

Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications

Boosted by the exponential growth of microbiome-based studies, analyzing microbiome patterns is now a hot-topic, finding different fields of application. In particular, the use of machine learning techniques is increasing in microbiome studies, providing deep insights into microbial community compos...

Descripción completa

Detalles Bibliográficos
Autores principales: Giulia, Agostinetto, Anna, Sandionigi, Antonia, Bruno, Dario, Pescini, Maurizio, Casiraghi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580939/
https://www.ncbi.nlm.nih.gov/pubmed/36303759
http://dx.doi.org/10.3389/fbinf.2021.794547
_version_ 1784812505141346304
author Giulia, Agostinetto
Anna, Sandionigi
Antonia, Bruno
Dario, Pescini
Maurizio, Casiraghi
author_facet Giulia, Agostinetto
Anna, Sandionigi
Antonia, Bruno
Dario, Pescini
Maurizio, Casiraghi
author_sort Giulia, Agostinetto
collection PubMed
description Boosted by the exponential growth of microbiome-based studies, analyzing microbiome patterns is now a hot-topic, finding different fields of application. In particular, the use of machine learning techniques is increasing in microbiome studies, providing deep insights into microbial community composition. In this context, in order to investigate microbial patterns from 16S rRNA metabarcoding data, we explored the effectiveness of Association Rule Mining (ARM) technique, a supervised-machine learning procedure, to extract patterns (in this work, intended as groups of species or taxa) from microbiome data. ARM can generate huge amounts of data, making spurious information removal and visualizing results challenging. Our work sheds light on the strengths and weaknesses of pattern mining strategy into the study of microbial patterns, in particular from 16S rRNA microbiome datasets, applying ARM on real case studies and providing guidelines for future usage. Our results highlighted issues related to the type of input and the use of metadata in microbial pattern extraction, identifying the key steps that must be considered to apply ARM consciously on 16S rRNA microbiome data. To promote the use of ARM and the visualization of microbiome patterns, specifically, we developed microFIM (microbial Frequent Itemset Mining), a versatile Python tool that facilitates the use of ARM integrating common microbiome outputs, such as taxa tables. microFIM implements interest measures to remove spurious information and merges the results of ARM analysis with the common microbiome outputs, providing similar microbiome strategies that help scientists to integrate ARM in microbiome applications. With this work, we aimed at creating a bridge between microbial ecology researchers and ARM technique, making researchers aware about the strength and weaknesses of association rule mining approach.
format Online
Article
Text
id pubmed-9580939
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95809392022-10-26 Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications Giulia, Agostinetto Anna, Sandionigi Antonia, Bruno Dario, Pescini Maurizio, Casiraghi Front Bioinform Bioinformatics Boosted by the exponential growth of microbiome-based studies, analyzing microbiome patterns is now a hot-topic, finding different fields of application. In particular, the use of machine learning techniques is increasing in microbiome studies, providing deep insights into microbial community composition. In this context, in order to investigate microbial patterns from 16S rRNA metabarcoding data, we explored the effectiveness of Association Rule Mining (ARM) technique, a supervised-machine learning procedure, to extract patterns (in this work, intended as groups of species or taxa) from microbiome data. ARM can generate huge amounts of data, making spurious information removal and visualizing results challenging. Our work sheds light on the strengths and weaknesses of pattern mining strategy into the study of microbial patterns, in particular from 16S rRNA microbiome datasets, applying ARM on real case studies and providing guidelines for future usage. Our results highlighted issues related to the type of input and the use of metadata in microbial pattern extraction, identifying the key steps that must be considered to apply ARM consciously on 16S rRNA microbiome data. To promote the use of ARM and the visualization of microbiome patterns, specifically, we developed microFIM (microbial Frequent Itemset Mining), a versatile Python tool that facilitates the use of ARM integrating common microbiome outputs, such as taxa tables. microFIM implements interest measures to remove spurious information and merges the results of ARM analysis with the common microbiome outputs, providing similar microbiome strategies that help scientists to integrate ARM in microbiome applications. With this work, we aimed at creating a bridge between microbial ecology researchers and ARM technique, making researchers aware about the strength and weaknesses of association rule mining approach. Frontiers Media S.A. 2022-01-10 /pmc/articles/PMC9580939/ /pubmed/36303759 http://dx.doi.org/10.3389/fbinf.2021.794547 Text en Copyright © 2022 Giulia, Anna, Antonia, Dario and Maurizio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Giulia, Agostinetto
Anna, Sandionigi
Antonia, Bruno
Dario, Pescini
Maurizio, Casiraghi
Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title_full Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title_fullStr Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title_full_unstemmed Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title_short Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications
title_sort extending association rule mining to microbiome pattern analysis: tools and guidelines to support real applications
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580939/
https://www.ncbi.nlm.nih.gov/pubmed/36303759
http://dx.doi.org/10.3389/fbinf.2021.794547
work_keys_str_mv AT giuliaagostinetto extendingassociationruleminingtomicrobiomepatternanalysistoolsandguidelinestosupportrealapplications
AT annasandionigi extendingassociationruleminingtomicrobiomepatternanalysistoolsandguidelinestosupportrealapplications
AT antoniabruno extendingassociationruleminingtomicrobiomepatternanalysistoolsandguidelinestosupportrealapplications
AT dariopescini extendingassociationruleminingtomicrobiomepatternanalysistoolsandguidelinestosupportrealapplications
AT mauriziocasiraghi extendingassociationruleminingtomicrobiomepatternanalysistoolsandguidelinestosupportrealapplications