Cargando…

Learning directed acyclic graphs from large-scale genomics data

In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions...

Descripción completa

Detalles Bibliográficos
Autores principales: Nikolay, Fabio, Pesavento, Marius, Kritikos, George, Typas, Nassos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5607220/
https://www.ncbi.nlm.nih.gov/pubmed/28933027
http://dx.doi.org/10.1186/s13637-017-0063-3
_version_ 1783265250773041152
author Nikolay, Fabio
Pesavento, Marius
Kritikos, George
Typas, Nassos
author_facet Nikolay, Fabio
Pesavento, Marius
Kritikos, George
Typas, Nassos
author_sort Nikolay, Fabio
collection PubMed
description In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions between genes. We propose a novel linear integer optimization program called the Genetic-Interactions-Detector (GENIE) to identify the complex biological dependencies among genes and to compute the DAG topology that matches the DK measurements best. Furthermore, we extend the GENIE program by incorporating genetic interaction profile (GI-profile) data to further enhance the detection performance. In addition, we propose a sequential scalability technique for large sets of genes under study, in order to provide statistically significant results for real measurement data. Finally, we show via numeric simulations that the GENIE program and the GI-profile data extended GENIE (GI-GENIE) program clearly outperform the conventional techniques and present real data results for our proposed sequential scalability technique.
format Online
Article
Text
id pubmed-5607220
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-56072202017-10-10 Learning directed acyclic graphs from large-scale genomics data Nikolay, Fabio Pesavento, Marius Kritikos, George Typas, Nassos EURASIP J Bioinform Syst Biol Research In this paper, we consider the problem of learning the genetic interaction map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double-knockout (DK) data. Based on a set of well-established biological interaction models, we detect and classify the interactions between genes. We propose a novel linear integer optimization program called the Genetic-Interactions-Detector (GENIE) to identify the complex biological dependencies among genes and to compute the DAG topology that matches the DK measurements best. Furthermore, we extend the GENIE program by incorporating genetic interaction profile (GI-profile) data to further enhance the detection performance. In addition, we propose a sequential scalability technique for large sets of genes under study, in order to provide statistically significant results for real measurement data. Finally, we show via numeric simulations that the GENIE program and the GI-profile data extended GENIE (GI-GENIE) program clearly outperform the conventional techniques and present real data results for our proposed sequential scalability technique. Springer International Publishing 2017-09-20 /pmc/articles/PMC5607220/ /pubmed/28933027 http://dx.doi.org/10.1186/s13637-017-0063-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Nikolay, Fabio
Pesavento, Marius
Kritikos, George
Typas, Nassos
Learning directed acyclic graphs from large-scale genomics data
title Learning directed acyclic graphs from large-scale genomics data
title_full Learning directed acyclic graphs from large-scale genomics data
title_fullStr Learning directed acyclic graphs from large-scale genomics data
title_full_unstemmed Learning directed acyclic graphs from large-scale genomics data
title_short Learning directed acyclic graphs from large-scale genomics data
title_sort learning directed acyclic graphs from large-scale genomics data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5607220/
https://www.ncbi.nlm.nih.gov/pubmed/28933027
http://dx.doi.org/10.1186/s13637-017-0063-3
work_keys_str_mv AT nikolayfabio learningdirectedacyclicgraphsfromlargescalegenomicsdata
AT pesaventomarius learningdirectedacyclicgraphsfromlargescalegenomicsdata
AT kritikosgeorge learningdirectedacyclicgraphsfromlargescalegenomicsdata
AT typasnassos learningdirectedacyclicgraphsfromlargescalegenomicsdata