Cargando…

MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks

High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wani, Nisar, Raza, Khalid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924726/
https://www.ncbi.nlm.nih.gov/pubmed/33817013
http://dx.doi.org/10.7717/peerj-cs.363
_version_ 1783659150612365312
author Wani, Nisar
Raza, Khalid
author_facet Wani, Nisar
Raza, Khalid
author_sort Wani, Nisar
collection PubMed
description High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets.
format Online
Article
Text
id pubmed-7924726
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79247262021-04-02 MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks Wani, Nisar Raza, Khalid PeerJ Comput Sci Bioinformatics High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets. PeerJ Inc. 2021-01-28 /pmc/articles/PMC7924726/ /pubmed/33817013 http://dx.doi.org/10.7717/peerj-cs.363 Text en © 2021 Wani and Raza https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Wani, Nisar
Raza, Khalid
MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title_full MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title_fullStr MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title_full_unstemmed MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title_short MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
title_sort mkl-grni: a parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924726/
https://www.ncbi.nlm.nih.gov/pubmed/33817013
http://dx.doi.org/10.7717/peerj-cs.363
work_keys_str_mv AT waninisar mklgrniaparallelmultiplekernellearningapproachforsupervisedinferenceoflargescalegeneregulatorynetworks
AT razakhalid mklgrniaparallelmultiplekernellearningapproachforsupervisedinferenceoflargescalegeneregulatorynetworks