Cargando…

The approximability of the String Barcoding problem

The String Barcoding (SBC) problem, introduced by Rash and Gusfield (RECOMB, 2002), consists in finding a minimum set of substrings that can be used to distinguish between all members of a set of given strings. In a computational biology context, the given strings represent a set of known viruses, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Lancia, Giuseppe, Rizzi, Romeo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1590015/
https://www.ncbi.nlm.nih.gov/pubmed/16895600
http://dx.doi.org/10.1186/1748-7188-1-12
_version_ 1782130358793273344
author Lancia, Giuseppe
Rizzi, Romeo
author_facet Lancia, Giuseppe
Rizzi, Romeo
author_sort Lancia, Giuseppe
collection PubMed
description The String Barcoding (SBC) problem, introduced by Rash and Gusfield (RECOMB, 2002), consists in finding a minimum set of substrings that can be used to distinguish between all members of a set of given strings. In a computational biology context, the given strings represent a set of known viruses, while the substrings can be used as probes for an hybridization experiment via microarray. Eventually, one aims at the classification of new strings (unknown viruses) through the result of the hybridization experiment. In this paper we show that SBC is as hard to approximate as Set Cover. Furthermore, we show that the constrained version of SBC (with probes of bounded length) is also hard to approximate. These negative results are tight.
format Text
id pubmed-1590015
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15900152006-10-05 The approximability of the String Barcoding problem Lancia, Giuseppe Rizzi, Romeo Algorithms Mol Biol Research The String Barcoding (SBC) problem, introduced by Rash and Gusfield (RECOMB, 2002), consists in finding a minimum set of substrings that can be used to distinguish between all members of a set of given strings. In a computational biology context, the given strings represent a set of known viruses, while the substrings can be used as probes for an hybridization experiment via microarray. Eventually, one aims at the classification of new strings (unknown viruses) through the result of the hybridization experiment. In this paper we show that SBC is as hard to approximate as Set Cover. Furthermore, we show that the constrained version of SBC (with probes of bounded length) is also hard to approximate. These negative results are tight. BioMed Central 2006-08-08 /pmc/articles/PMC1590015/ /pubmed/16895600 http://dx.doi.org/10.1186/1748-7188-1-12 Text en Copyright © 2006 Lancia and Rizzi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lancia, Giuseppe
Rizzi, Romeo
The approximability of the String Barcoding problem
title The approximability of the String Barcoding problem
title_full The approximability of the String Barcoding problem
title_fullStr The approximability of the String Barcoding problem
title_full_unstemmed The approximability of the String Barcoding problem
title_short The approximability of the String Barcoding problem
title_sort approximability of the string barcoding problem
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1590015/
https://www.ncbi.nlm.nih.gov/pubmed/16895600
http://dx.doi.org/10.1186/1748-7188-1-12
work_keys_str_mv AT lanciagiuseppe theapproximabilityofthestringbarcodingproblem
AT rizziromeo theapproximabilityofthestringbarcodingproblem
AT lanciagiuseppe approximabilityofthestringbarcodingproblem
AT rizziromeo approximabilityofthestringbarcodingproblem