Cargando…
A Family of Tree-Based Generators for Bubbles in Directed Graphs
Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254895/ http://dx.doi.org/10.1007/978-3-030-48966-3_2 |
_version_ | 1783539630479507456 |
---|---|
author | Acuña, Vicente Lima, Leandro Italiano, Giuseppe F. Pepè Sciarria, Luca Sagot, Marie-France Sinaimeri, Blerina |
author_facet | Acuña, Vicente Lima, Leandro Italiano, Giuseppe F. Pepè Sciarria, Luca Sagot, Marie-France Sinaimeri, Blerina |
author_sort | Acuña, Vicente |
collection | PubMed |
description | Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on the previous generator: generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new generators, we also report some experimental results on a real dataset. |
format | Online Article Text |
id | pubmed-7254895 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72548952020-05-28 A Family of Tree-Based Generators for Bubbles in Directed Graphs Acuña, Vicente Lima, Leandro Italiano, Giuseppe F. Pepè Sciarria, Luca Sagot, Marie-France Sinaimeri, Blerina Combinatorial Algorithms Article Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on the previous generator: generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new generators, we also report some experimental results on a real dataset. 2020-04-30 /pmc/articles/PMC7254895/ http://dx.doi.org/10.1007/978-3-030-48966-3_2 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Acuña, Vicente Lima, Leandro Italiano, Giuseppe F. Pepè Sciarria, Luca Sagot, Marie-France Sinaimeri, Blerina A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title | A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title_full | A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title_fullStr | A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title_full_unstemmed | A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title_short | A Family of Tree-Based Generators for Bubbles in Directed Graphs |
title_sort | family of tree-based generators for bubbles in directed graphs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254895/ http://dx.doi.org/10.1007/978-3-030-48966-3_2 |
work_keys_str_mv | AT acunavicente afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT limaleandro afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT italianogiuseppef afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT pepesciarrialuca afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT sagotmariefrance afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT sinaimeriblerina afamilyoftreebasedgeneratorsforbubblesindirectedgraphs AT acunavicente familyoftreebasedgeneratorsforbubblesindirectedgraphs AT limaleandro familyoftreebasedgeneratorsforbubblesindirectedgraphs AT italianogiuseppef familyoftreebasedgeneratorsforbubblesindirectedgraphs AT pepesciarrialuca familyoftreebasedgeneratorsforbubblesindirectedgraphs AT sagotmariefrance familyoftreebasedgeneratorsforbubblesindirectedgraphs AT sinaimeriblerina familyoftreebasedgeneratorsforbubblesindirectedgraphs |