Cargando…

On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly

BACKGROUND: Multi-assembly problems have gathered much attention in the last years, as Next-Generation Sequencing technologies have started being applied to mixed settings, such as reads from the transcriptome (RNA-Seq), or from viral quasi-species. One classical model that has resurfaced in many mu...

Descripción completa

Detalles Bibliográficos
Autores principales: Rizzi, Romeo, Tomescu, Alexandru I, Mäkinen, Veli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168716/
https://www.ncbi.nlm.nih.gov/pubmed/25252805
http://dx.doi.org/10.1186/1471-2105-15-S9-S5
_version_ 1782335605529640960
author Rizzi, Romeo
Tomescu, Alexandru I
Mäkinen, Veli
author_facet Rizzi, Romeo
Tomescu, Alexandru I
Mäkinen, Veli
author_sort Rizzi, Romeo
collection PubMed
description BACKGROUND: Multi-assembly problems have gathered much attention in the last years, as Next-Generation Sequencing technologies have started being applied to mixed settings, such as reads from the transcriptome (RNA-Seq), or from viral quasi-species. One classical model that has resurfaced in many multi-assembly methods (e.g. in Cufflinks, ShoRAH, BRANCH, CLASS) is the Minimum Path Cover (MPC) Problem, which asks for the minimum number of directed paths that cover all the nodes of a directed acyclic graph. The MPC Problem is highly popular because the acyclicity of the graph ensures its polynomial-time solvability. RESULTS: In this paper, we consider two generalizations of it dealing with integrating constraints arising from long reads or paired-end reads; these extensions have also been considered by two recent methods, but not fully solved. More specifically, we study the two problems where also a set of subpaths, or pairs of subpaths, of the graph have to be entirely covered by some path in the MPC. We show that in the case of long reads (subpaths), the generalized problem can be solved in polynomial-time by a reduction to the classical MPC Problem. We also consider the weighted case, and show that it can be solved in polynomial-time by a reduction to a min-cost circulation problem. As a side result, we also improve the time complexity of the classical minimum weight MPC Problem. In the case of paired-end reads (pairs of subpaths), the generalized problem becomes NP-hard, but we show that it is fixed-parameter tractable (FPT) in the total number of constraints. This computational dichotomy between long reads and paired-end reads is also a general insight into multi-assembly problems.
format Online
Article
Text
id pubmed-4168716
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41687162014-10-02 On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly Rizzi, Romeo Tomescu, Alexandru I Mäkinen, Veli BMC Bioinformatics Proceedings BACKGROUND: Multi-assembly problems have gathered much attention in the last years, as Next-Generation Sequencing technologies have started being applied to mixed settings, such as reads from the transcriptome (RNA-Seq), or from viral quasi-species. One classical model that has resurfaced in many multi-assembly methods (e.g. in Cufflinks, ShoRAH, BRANCH, CLASS) is the Minimum Path Cover (MPC) Problem, which asks for the minimum number of directed paths that cover all the nodes of a directed acyclic graph. The MPC Problem is highly popular because the acyclicity of the graph ensures its polynomial-time solvability. RESULTS: In this paper, we consider two generalizations of it dealing with integrating constraints arising from long reads or paired-end reads; these extensions have also been considered by two recent methods, but not fully solved. More specifically, we study the two problems where also a set of subpaths, or pairs of subpaths, of the graph have to be entirely covered by some path in the MPC. We show that in the case of long reads (subpaths), the generalized problem can be solved in polynomial-time by a reduction to the classical MPC Problem. We also consider the weighted case, and show that it can be solved in polynomial-time by a reduction to a min-cost circulation problem. As a side result, we also improve the time complexity of the classical minimum weight MPC Problem. In the case of paired-end reads (pairs of subpaths), the generalized problem becomes NP-hard, but we show that it is fixed-parameter tractable (FPT) in the total number of constraints. This computational dichotomy between long reads and paired-end reads is also a general insight into multi-assembly problems. BioMed Central 2014-09-10 /pmc/articles/PMC4168716/ /pubmed/25252805 http://dx.doi.org/10.1186/1471-2105-15-S9-S5 Text en Copyright © 2014 Rizzi and Tomescu; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Rizzi, Romeo
Tomescu, Alexandru I
Mäkinen, Veli
On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title_full On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title_fullStr On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title_full_unstemmed On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title_short On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
title_sort on the complexity of minimum path cover with subpath constraints for multi-assembly
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168716/
https://www.ncbi.nlm.nih.gov/pubmed/25252805
http://dx.doi.org/10.1186/1471-2105-15-S9-S5
work_keys_str_mv AT rizziromeo onthecomplexityofminimumpathcoverwithsubpathconstraintsformultiassembly
AT tomescualexandrui onthecomplexityofminimumpathcoverwithsubpathconstraintsformultiassembly
AT makinenveli onthecomplexityofminimumpathcoverwithsubpathconstraintsformultiassembly