Cargando…
MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924726/ https://www.ncbi.nlm.nih.gov/pubmed/33817013 http://dx.doi.org/10.7717/peerj-cs.363 |
_version_ | 1783659150612365312 |
---|---|
author | Wani, Nisar Raza, Khalid |
author_facet | Wani, Nisar Raza, Khalid |
author_sort | Wani, Nisar |
collection | PubMed |
description | High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets. |
format | Online Article Text |
id | pubmed-7924726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79247262021-04-02 MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks Wani, Nisar Raza, Khalid PeerJ Comput Sci Bioinformatics High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets. PeerJ Inc. 2021-01-28 /pmc/articles/PMC7924726/ /pubmed/33817013 http://dx.doi.org/10.7717/peerj-cs.363 Text en © 2021 Wani and Raza https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Wani, Nisar Raza, Khalid MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title | MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title_full | MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title_fullStr | MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title_full_unstemmed | MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title_short | MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
title_sort | mkl-grni: a parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924726/ https://www.ncbi.nlm.nih.gov/pubmed/33817013 http://dx.doi.org/10.7717/peerj-cs.363 |
work_keys_str_mv | AT waninisar mklgrniaparallelmultiplekernellearningapproachforsupervisedinferenceoflargescalegeneregulatorynetworks AT razakhalid mklgrniaparallelmultiplekernellearningapproachforsupervisedinferenceoflargescalegeneregulatorynetworks |