Cargando…

BOA: A partitioned view of genome assembly

De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is on...

Descripción completa

Detalles Bibliográficos
Autores principales: An, Xiaojing, Ghosh, Priyanka, Keppler, Patrick, Kurt, Sureyya Emre, Krishnamoorthy, Sriram, Sadayappan, Ponnuswamy, Rajam, Aravind Sukumaran, Çatalyürek, Ümit V., Kalyanaraman, Ananth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9593263/
https://www.ncbi.nlm.nih.gov/pubmed/36304115
http://dx.doi.org/10.1016/j.isci.2022.105273
_version_ 1784815122181521408
author An, Xiaojing
Ghosh, Priyanka
Keppler, Patrick
Kurt, Sureyya Emre
Krishnamoorthy, Sriram
Sadayappan, Ponnuswamy
Rajam, Aravind Sukumaran
Çatalyürek, Ümit V.
Kalyanaraman, Ananth
author_facet An, Xiaojing
Ghosh, Priyanka
Keppler, Patrick
Kurt, Sureyya Emre
Krishnamoorthy, Sriram
Sadayappan, Ponnuswamy
Rajam, Aravind Sukumaran
Çatalyürek, Ümit V.
Kalyanaraman, Ananth
author_sort An, Xiaojing
collection PubMed
description De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high degree of parallelism, we present a partitioning-based approach. Our framework, BOA (bucket-order-assemble), uses a bucketing alongside graph- and hypergraph-based partitioning techniques to produce a partial ordering of the reads. This partial ordering enables us to divide the read set into disjoint blocks that can be independently assembled in parallel using any state-of-the-art serial assembler of choice. Experimental results show that BOA improves both the overall assembly quality and performance.
format Online
Article
Text
id pubmed-9593263
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-95932632022-10-26 BOA: A partitioned view of genome assembly An, Xiaojing Ghosh, Priyanka Keppler, Patrick Kurt, Sureyya Emre Krishnamoorthy, Sriram Sadayappan, Ponnuswamy Rajam, Aravind Sukumaran Çatalyürek, Ümit V. Kalyanaraman, Ananth iScience Article De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high degree of parallelism, we present a partitioning-based approach. Our framework, BOA (bucket-order-assemble), uses a bucketing alongside graph- and hypergraph-based partitioning techniques to produce a partial ordering of the reads. This partial ordering enables us to divide the read set into disjoint blocks that can be independently assembled in parallel using any state-of-the-art serial assembler of choice. Experimental results show that BOA improves both the overall assembly quality and performance. Elsevier 2022-10-08 /pmc/articles/PMC9593263/ /pubmed/36304115 http://dx.doi.org/10.1016/j.isci.2022.105273 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
An, Xiaojing
Ghosh, Priyanka
Keppler, Patrick
Kurt, Sureyya Emre
Krishnamoorthy, Sriram
Sadayappan, Ponnuswamy
Rajam, Aravind Sukumaran
Çatalyürek, Ümit V.
Kalyanaraman, Ananth
BOA: A partitioned view of genome assembly
title BOA: A partitioned view of genome assembly
title_full BOA: A partitioned view of genome assembly
title_fullStr BOA: A partitioned view of genome assembly
title_full_unstemmed BOA: A partitioned view of genome assembly
title_short BOA: A partitioned view of genome assembly
title_sort boa: a partitioned view of genome assembly
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9593263/
https://www.ncbi.nlm.nih.gov/pubmed/36304115
http://dx.doi.org/10.1016/j.isci.2022.105273
work_keys_str_mv AT anxiaojing boaapartitionedviewofgenomeassembly
AT ghoshpriyanka boaapartitionedviewofgenomeassembly
AT kepplerpatrick boaapartitionedviewofgenomeassembly
AT kurtsureyyaemre boaapartitionedviewofgenomeassembly
AT krishnamoorthysriram boaapartitionedviewofgenomeassembly
AT sadayappanponnuswamy boaapartitionedviewofgenomeassembly
AT rajamaravindsukumaran boaapartitionedviewofgenomeassembly
AT catalyurekumitv boaapartitionedviewofgenomeassembly
AT kalyanaramanananth boaapartitionedviewofgenomeassembly