Cargando…
Assembling contigs in draft genomes using reversals and block-interchanges
The techniques of next generation sequencing allow an increasing number of draft genomes to be produced rapidly in a decreasing cost. However, these draft genomes usually are just partially sequenced as collections of unassembled contigs, which cannot be used directly by currently existing algorithm...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622642/ https://www.ncbi.nlm.nih.gov/pubmed/23734866 http://dx.doi.org/10.1186/1471-2105-14-S5-S9 |
_version_ | 1782265859859808256 |
---|---|
author | Li, Chi-Long Chen, Kun-Tze Lu, Chin Lung |
author_facet | Li, Chi-Long Chen, Kun-Tze Lu, Chin Lung |
author_sort | Li, Chi-Long |
collection | PubMed |
description | The techniques of next generation sequencing allow an increasing number of draft genomes to be produced rapidly in a decreasing cost. However, these draft genomes usually are just partially sequenced as collections of unassembled contigs, which cannot be used directly by currently existing algorithms for studying their genome rearrangements and phylogeny reconstruction. In this work, we study the one-sided block (or contig) ordering problem with weighted reversal and block-interchange distance. Given a partially assembled genome π and a completely assembled genome σ, the problem is to find an optimal ordering to assemble (i.e., order and orient) the contigs of π such that the rearrangement distance measured by reversals and block-interchanges (also called generalized transpositions) with the weight ratio 1:2 between the assembled contigs of π and σ is minimized. In addition to genome rearrangements and phylogeny reconstruction, the one-sided block ordering problem particularly has a useful application in genome resequencing, because its algorithms can be used to assemble the contigs of a draft genome π based on a reference genome σ. By using permutation groups, we design an efficient algorithm to solve this one-sided block ordering problem in [Formula: see text] time, where n is the number of genes or markers and δ is the number of used reversals and block-interchanges. We also show that the assembly of the partially assembled genome can be done in [Formula: see text] time and its weighted rearrangement distance from the completely assembled genome can be calculated in advance in [Formula: see text] time. Finally, we have implemented our algorithm into a program and used some simulated datasets to compare its accuracy performance to a currently existing similar tool, called SIS that was implemented by a heuristic algorithm that considers only reversals, on assembling the contigs in draft genomes based on their reference genomes. Our experimental results have shown that the accuracy performance of our program is better than that of SIS, when the number of reversals and transpositions involved in the rearrangement events between the complete genomes of π and σ is increased. In particular, if there are more transpositions involved in the rearrangement events, then the gap of accuracy performance between our program and SIS is increasing. |
format | Online Article Text |
id | pubmed-3622642 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36226422013-04-15 Assembling contigs in draft genomes using reversals and block-interchanges Li, Chi-Long Chen, Kun-Tze Lu, Chin Lung BMC Bioinformatics Proceedings The techniques of next generation sequencing allow an increasing number of draft genomes to be produced rapidly in a decreasing cost. However, these draft genomes usually are just partially sequenced as collections of unassembled contigs, which cannot be used directly by currently existing algorithms for studying their genome rearrangements and phylogeny reconstruction. In this work, we study the one-sided block (or contig) ordering problem with weighted reversal and block-interchange distance. Given a partially assembled genome π and a completely assembled genome σ, the problem is to find an optimal ordering to assemble (i.e., order and orient) the contigs of π such that the rearrangement distance measured by reversals and block-interchanges (also called generalized transpositions) with the weight ratio 1:2 between the assembled contigs of π and σ is minimized. In addition to genome rearrangements and phylogeny reconstruction, the one-sided block ordering problem particularly has a useful application in genome resequencing, because its algorithms can be used to assemble the contigs of a draft genome π based on a reference genome σ. By using permutation groups, we design an efficient algorithm to solve this one-sided block ordering problem in [Formula: see text] time, where n is the number of genes or markers and δ is the number of used reversals and block-interchanges. We also show that the assembly of the partially assembled genome can be done in [Formula: see text] time and its weighted rearrangement distance from the completely assembled genome can be calculated in advance in [Formula: see text] time. Finally, we have implemented our algorithm into a program and used some simulated datasets to compare its accuracy performance to a currently existing similar tool, called SIS that was implemented by a heuristic algorithm that considers only reversals, on assembling the contigs in draft genomes based on their reference genomes. Our experimental results have shown that the accuracy performance of our program is better than that of SIS, when the number of reversals and transpositions involved in the rearrangement events between the complete genomes of π and σ is increased. In particular, if there are more transpositions involved in the rearrangement events, then the gap of accuracy performance between our program and SIS is increasing. BioMed Central 2013-04-10 /pmc/articles/PMC3622642/ /pubmed/23734866 http://dx.doi.org/10.1186/1471-2105-14-S5-S9 Text en Copyright © 2013 Li et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Li, Chi-Long Chen, Kun-Tze Lu, Chin Lung Assembling contigs in draft genomes using reversals and block-interchanges |
title | Assembling contigs in draft genomes using reversals and block-interchanges |
title_full | Assembling contigs in draft genomes using reversals and block-interchanges |
title_fullStr | Assembling contigs in draft genomes using reversals and block-interchanges |
title_full_unstemmed | Assembling contigs in draft genomes using reversals and block-interchanges |
title_short | Assembling contigs in draft genomes using reversals and block-interchanges |
title_sort | assembling contigs in draft genomes using reversals and block-interchanges |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622642/ https://www.ncbi.nlm.nih.gov/pubmed/23734866 http://dx.doi.org/10.1186/1471-2105-14-S5-S9 |
work_keys_str_mv | AT lichilong assemblingcontigsindraftgenomesusingreversalsandblockinterchanges AT chenkuntze assemblingcontigsindraftgenomesusingreversalsandblockinterchanges AT luchinlung assemblingcontigsindraftgenomesusingreversalsandblockinterchanges |