Cargando…

A Family of Tree-Based Generators for Bubbles in Directed Graphs

Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in...

Descripción completa

Detalles Bibliográficos
Autores principales: Acuña, Vicente, Lima, Leandro, Italiano, Giuseppe F., Pepè Sciarria, Luca, Sagot, Marie-France, Sinaimeri, Blerina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254895/
http://dx.doi.org/10.1007/978-3-030-48966-3_2
_version_ 1783539630479507456
author Acuña, Vicente
Lima, Leandro
Italiano, Giuseppe F.
Pepè Sciarria, Luca
Sagot, Marie-France
Sinaimeri, Blerina
author_facet Acuña, Vicente
Lima, Leandro
Italiano, Giuseppe F.
Pepè Sciarria, Luca
Sagot, Marie-France
Sinaimeri, Blerina
author_sort Acuña, Vicente
collection PubMed
description Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on the previous generator: generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new generators, we also report some experimental results on a real dataset.
format Online
Article
Text
id pubmed-7254895
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72548952020-05-28 A Family of Tree-Based Generators for Bubbles in Directed Graphs Acuña, Vicente Lima, Leandro Italiano, Giuseppe F. Pepè Sciarria, Luca Sagot, Marie-France Sinaimeri, Blerina Combinatorial Algorithms Article Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on the previous generator: generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new generators, we also report some experimental results on a real dataset. 2020-04-30 /pmc/articles/PMC7254895/ http://dx.doi.org/10.1007/978-3-030-48966-3_2 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Acuña, Vicente
Lima, Leandro
Italiano, Giuseppe F.
Pepè Sciarria, Luca
Sagot, Marie-France
Sinaimeri, Blerina
A Family of Tree-Based Generators for Bubbles in Directed Graphs
title A Family of Tree-Based Generators for Bubbles in Directed Graphs
title_full A Family of Tree-Based Generators for Bubbles in Directed Graphs
title_fullStr A Family of Tree-Based Generators for Bubbles in Directed Graphs
title_full_unstemmed A Family of Tree-Based Generators for Bubbles in Directed Graphs
title_short A Family of Tree-Based Generators for Bubbles in Directed Graphs
title_sort family of tree-based generators for bubbles in directed graphs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254895/
http://dx.doi.org/10.1007/978-3-030-48966-3_2
work_keys_str_mv AT acunavicente afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT limaleandro afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT italianogiuseppef afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT pepesciarrialuca afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT sagotmariefrance afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT sinaimeriblerina afamilyoftreebasedgeneratorsforbubblesindirectedgraphs
AT acunavicente familyoftreebasedgeneratorsforbubblesindirectedgraphs
AT limaleandro familyoftreebasedgeneratorsforbubblesindirectedgraphs
AT italianogiuseppef familyoftreebasedgeneratorsforbubblesindirectedgraphs
AT pepesciarrialuca familyoftreebasedgeneratorsforbubblesindirectedgraphs
AT sagotmariefrance familyoftreebasedgeneratorsforbubblesindirectedgraphs
AT sinaimeriblerina familyoftreebasedgeneratorsforbubblesindirectedgraphs