Cargando…

Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data

Logic relationship analysis is a data mining method that comprehensively detects item triplets that satisfy logic relationships from a binary matrix dataset, such as an ortholog table in comparative genomics. Thanks to recent technological advancements, many binary matrix datasets are now being prod...

Descripción completa

Detalles Bibliográficos
Autores principales: Fukunaga, Tsukasa, Iwasaki, Wataru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7194410/
https://www.ncbi.nlm.nih.gov/pubmed/32357172
http://dx.doi.org/10.1371/journal.pone.0232106
_version_ 1783528335430647808
author Fukunaga, Tsukasa
Iwasaki, Wataru
author_facet Fukunaga, Tsukasa
Iwasaki, Wataru
author_sort Fukunaga, Tsukasa
collection PubMed
description Logic relationship analysis is a data mining method that comprehensively detects item triplets that satisfy logic relationships from a binary matrix dataset, such as an ortholog table in comparative genomics. Thanks to recent technological advancements, many binary matrix datasets are now being produced in genomics, transcriptomics, epigenomics, metagenomics, and many other fields for comparative purposes. However, regardless of presumed interpretability and importance of logic relationships, existing data mining methods are not based on the framework of statistical hypothesis testing. That means, the type-1 and type-2 error rates are neither controlled nor estimated. Here, we developed Logicome Profiler, which exhaustively detects statistically significant triplet logic relationships from a binary matrix dataset (Logicome means ome of logics). To test all item triplets in a dataset while avoiding false positives, Logicome Profiler adjusts a significance level by the Bonferroni or Benjamini-Yekutieli method for the multiple testing correction. Its application to an ocean metagenomic dataset showed that Logicome Profiler can effectively detect statistically significant triplet logic relationships among environmental microbes and genes, which include those among urea transporter, urease, and photosynthesis-related genes. Beyond omics data analysis, Logicome Profiler is applicable to various binary matrix datasets in general for finding significant triplet logic relationships. The source code is available at https://github.com/fukunagatsu/LogicomeProfiler.
format Online
Article
Text
id pubmed-7194410
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71944102020-05-12 Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data Fukunaga, Tsukasa Iwasaki, Wataru PLoS One Research Article Logic relationship analysis is a data mining method that comprehensively detects item triplets that satisfy logic relationships from a binary matrix dataset, such as an ortholog table in comparative genomics. Thanks to recent technological advancements, many binary matrix datasets are now being produced in genomics, transcriptomics, epigenomics, metagenomics, and many other fields for comparative purposes. However, regardless of presumed interpretability and importance of logic relationships, existing data mining methods are not based on the framework of statistical hypothesis testing. That means, the type-1 and type-2 error rates are neither controlled nor estimated. Here, we developed Logicome Profiler, which exhaustively detects statistically significant triplet logic relationships from a binary matrix dataset (Logicome means ome of logics). To test all item triplets in a dataset while avoiding false positives, Logicome Profiler adjusts a significance level by the Bonferroni or Benjamini-Yekutieli method for the multiple testing correction. Its application to an ocean metagenomic dataset showed that Logicome Profiler can effectively detect statistically significant triplet logic relationships among environmental microbes and genes, which include those among urea transporter, urease, and photosynthesis-related genes. Beyond omics data analysis, Logicome Profiler is applicable to various binary matrix datasets in general for finding significant triplet logic relationships. The source code is available at https://github.com/fukunagatsu/LogicomeProfiler. Public Library of Science 2020-05-01 /pmc/articles/PMC7194410/ /pubmed/32357172 http://dx.doi.org/10.1371/journal.pone.0232106 Text en © 2020 Fukunaga, Iwasaki http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Fukunaga, Tsukasa
Iwasaki, Wataru
Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title_full Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title_fullStr Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title_full_unstemmed Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title_short Logicome Profiler: Exhaustive detection of statistically significant logic relationships from comparative omics data
title_sort logicome profiler: exhaustive detection of statistically significant logic relationships from comparative omics data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7194410/
https://www.ncbi.nlm.nih.gov/pubmed/32357172
http://dx.doi.org/10.1371/journal.pone.0232106
work_keys_str_mv AT fukunagatsukasa logicomeprofilerexhaustivedetectionofstatisticallysignificantlogicrelationshipsfromcomparativeomicsdata
AT iwasakiwataru logicomeprofilerexhaustivedetectionofstatisticallysignificantlogicrelationshipsfromcomparativeomicsdata