Cargando…
BOA: A partitioned view of genome assembly
De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is on...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9593263/ https://www.ncbi.nlm.nih.gov/pubmed/36304115 http://dx.doi.org/10.1016/j.isci.2022.105273 |
_version_ | 1784815122181521408 |
---|---|
author | An, Xiaojing Ghosh, Priyanka Keppler, Patrick Kurt, Sureyya Emre Krishnamoorthy, Sriram Sadayappan, Ponnuswamy Rajam, Aravind Sukumaran Çatalyürek, Ümit V. Kalyanaraman, Ananth |
author_facet | An, Xiaojing Ghosh, Priyanka Keppler, Patrick Kurt, Sureyya Emre Krishnamoorthy, Sriram Sadayappan, Ponnuswamy Rajam, Aravind Sukumaran Çatalyürek, Ümit V. Kalyanaraman, Ananth |
author_sort | An, Xiaojing |
collection | PubMed |
description | De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high degree of parallelism, we present a partitioning-based approach. Our framework, BOA (bucket-order-assemble), uses a bucketing alongside graph- and hypergraph-based partitioning techniques to produce a partial ordering of the reads. This partial ordering enables us to divide the read set into disjoint blocks that can be independently assembled in parallel using any state-of-the-art serial assembler of choice. Experimental results show that BOA improves both the overall assembly quality and performance. |
format | Online Article Text |
id | pubmed-9593263 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-95932632022-10-26 BOA: A partitioned view of genome assembly An, Xiaojing Ghosh, Priyanka Keppler, Patrick Kurt, Sureyya Emre Krishnamoorthy, Sriram Sadayappan, Ponnuswamy Rajam, Aravind Sukumaran Çatalyürek, Ümit V. Kalyanaraman, Ananth iScience Article De novo genome assembly is a fundamental problem in computational molecular biology that aims to reconstruct an unknown genome sequence from a set of short DNA sequences (or reads) obtained from the genome. The relative ordering of the reads along the target genome is not known a priori, which is one of the main contributors to the increased complexity of the assembly process. In this article, with the dual objective of improving assembly quality and exposing a high degree of parallelism, we present a partitioning-based approach. Our framework, BOA (bucket-order-assemble), uses a bucketing alongside graph- and hypergraph-based partitioning techniques to produce a partial ordering of the reads. This partial ordering enables us to divide the read set into disjoint blocks that can be independently assembled in parallel using any state-of-the-art serial assembler of choice. Experimental results show that BOA improves both the overall assembly quality and performance. Elsevier 2022-10-08 /pmc/articles/PMC9593263/ /pubmed/36304115 http://dx.doi.org/10.1016/j.isci.2022.105273 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article An, Xiaojing Ghosh, Priyanka Keppler, Patrick Kurt, Sureyya Emre Krishnamoorthy, Sriram Sadayappan, Ponnuswamy Rajam, Aravind Sukumaran Çatalyürek, Ümit V. Kalyanaraman, Ananth BOA: A partitioned view of genome assembly |
title | BOA: A partitioned view of genome assembly |
title_full | BOA: A partitioned view of genome assembly |
title_fullStr | BOA: A partitioned view of genome assembly |
title_full_unstemmed | BOA: A partitioned view of genome assembly |
title_short | BOA: A partitioned view of genome assembly |
title_sort | boa: a partitioned view of genome assembly |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9593263/ https://www.ncbi.nlm.nih.gov/pubmed/36304115 http://dx.doi.org/10.1016/j.isci.2022.105273 |
work_keys_str_mv | AT anxiaojing boaapartitionedviewofgenomeassembly AT ghoshpriyanka boaapartitionedviewofgenomeassembly AT kepplerpatrick boaapartitionedviewofgenomeassembly AT kurtsureyyaemre boaapartitionedviewofgenomeassembly AT krishnamoorthysriram boaapartitionedviewofgenomeassembly AT sadayappanponnuswamy boaapartitionedviewofgenomeassembly AT rajamaravindsukumaran boaapartitionedviewofgenomeassembly AT catalyurekumitv boaapartitionedviewofgenomeassembly AT kalyanaramanananth boaapartitionedviewofgenomeassembly |