Cargando…
Learning directed acyclic graphs from large-scale genomics data
In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5607220/ https://www.ncbi.nlm.nih.gov/pubmed/28933027 http://dx.doi.org/10.1186/s13637-017-0063-3 |
_version_ | 1783265250773041152 |
---|---|
author | Nikolay, Fabio Pesavento, Marius Kritikos, George Typas, Nassos |
author_facet | Nikolay, Fabio Pesavento, Marius Kritikos, George Typas, Nassos |
author_sort | Nikolay, Fabio |
collection | PubMed |
description | In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions between genes. We propose a novel linear integer optimization program called the Genetic-Interactions-Detector (GENIE) to identify the complex biological dependencies among genes and to compute the DAG topology that matches the DK measurements best. Furthermore, we extend the GENIE program by incorporating genetic interaction profile (GI-profile) data to further enhance the detection performance. In addition, we propose a sequential scalability technique for large sets of genes under study, in order to provide statistically significant results for real measurement data. Finally, we show via numeric simulations that the GENIE program and the GI-profile data extended GENIE (GI-GENIE) program clearly outperform the conventional techniques and present real data results for our proposed sequential scalability technique. |
format | Online Article Text |
id | pubmed-5607220 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-56072202017-10-10 Learning directed acyclic graphs from large-scale genomics data Nikolay, Fabio Pesavento, Marius Kritikos, George Typas, Nassos EURASIP J Bioinform Syst Biol Research In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions between genes. We propose a novel linear integer optimization program called the Genetic-Interactions-Detector (GENIE) to identify the complex biological dependencies among genes and to compute the DAG topology that matches the DK measurements best. Furthermore, we extend the GENIE program by incorporating genetic interaction profile (GI-profile) data to further enhance the detection performance. In addition, we propose a sequential scalability technique for large sets of genes under study, in order to provide statistically significant results for real measurement data. Finally, we show via numeric simulations that the GENIE program and the GI-profile data extended GENIE (GI-GENIE) program clearly outperform the conventional techniques and present real data results for our proposed sequential scalability technique. Springer International Publishing 2017-09-20 /pmc/articles/PMC5607220/ /pubmed/28933027 http://dx.doi.org/10.1186/s13637-017-0063-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Nikolay, Fabio Pesavento, Marius Kritikos, George Typas, Nassos Learning directed acyclic graphs from large-scale genomics data |
title | Learning directed acyclic graphs from large-scale genomics data |
title_full | Learning directed acyclic graphs from large-scale genomics data |
title_fullStr | Learning directed acyclic graphs from large-scale genomics data |
title_full_unstemmed | Learning directed acyclic graphs from large-scale genomics data |
title_short | Learning directed acyclic graphs from large-scale genomics data |
title_sort | learning directed acyclic graphs from large-scale genomics data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5607220/ https://www.ncbi.nlm.nih.gov/pubmed/28933027 http://dx.doi.org/10.1186/s13637-017-0063-3 |
work_keys_str_mv | AT nikolayfabio learningdirectedacyclicgraphsfromlargescalegenomicsdata AT pesaventomarius learningdirectedacyclicgraphsfromlargescalegenomicsdata AT kritikosgeorge learningdirectedacyclicgraphsfromlargescalegenomicsdata AT typasnassos learningdirectedacyclicgraphsfromlargescalegenomicsdata |