Cargando…

Super short operations on both gene order and intergenic sizes

BACKGROUND: The evolutionary distance between two genomes can be estimated by computing a minimum length sequence of operations, called genome rearrangements, that transform one genome into another. Usually, a genome is modeled as an ordered sequence of genes, and most of the studies in the genome r...

Descripción completa

Detalles Bibliográficos
Autores principales: Oliveira, Andre R., Jean, Géraldine, Fertin, Guillaume, Dias, Ulisses, Dias, Zanoni
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6833170/
https://www.ncbi.nlm.nih.gov/pubmed/31709002
http://dx.doi.org/10.1186/s13015-019-0156-5
_version_ 1783466319160541184
author Oliveira, Andre R.
Jean, Géraldine
Fertin, Guillaume
Dias, Ulisses
Dias, Zanoni
author_facet Oliveira, Andre R.
Jean, Géraldine
Fertin, Guillaume
Dias, Ulisses
Dias, Zanoni
author_sort Oliveira, Andre R.
collection PubMed
description BACKGROUND: The evolutionary distance between two genomes can be estimated by computing a minimum length sequence of operations, called genome rearrangements, that transform one genome into another. Usually, a genome is modeled as an ordered sequence of genes, and most of the studies in the genome rearrangement literature consist in shaping biological scenarios into mathematical models. For instance, allowing different genome rearrangements operations at the same time, adding constraints to these rearrangements (e.g., each rearrangement can affect at most a given number of genes), considering that a rearrangement implies a cost depending on its length rather than a unit cost, etc. Most of the works, however, have overlooked some important features inside genomes, such as the presence of sequences of nucleotides between genes, called intergenic regions. RESULTS AND CONCLUSIONS: In this work, we investigate the problem of computing the distance between two genomes, taking into account both gene order and intergenic sizes. The genome rearrangement operations we consider here are constrained types of reversals and transpositions, called super short reversals (SSRs) and super short transpositions (SSTs), which affect up to two (consecutive) genes. We denote by super short operations (SSOs) any SSR or SST. We show 3-approximation algorithms when the orientation of the genes is not considered when we allow SSRs, SSTs, or SSOs, and 5-approximation algorithms when considering the orientation for either SSRs or SSOs. We also show that these algorithms improve their approximation factors when the input permutation has a higher number of inversions, where the approximation factor decreases from 3 to either 2 or 1.5, and from 5 to either 3 or 2.
format Online
Article
Text
id pubmed-6833170
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68331702019-11-08 Super short operations on both gene order and intergenic sizes Oliveira, Andre R. Jean, Géraldine Fertin, Guillaume Dias, Ulisses Dias, Zanoni Algorithms Mol Biol Research BACKGROUND: The evolutionary distance between two genomes can be estimated by computing a minimum length sequence of operations, called genome rearrangements, that transform one genome into another. Usually, a genome is modeled as an ordered sequence of genes, and most of the studies in the genome rearrangement literature consist in shaping biological scenarios into mathematical models. For instance, allowing different genome rearrangements operations at the same time, adding constraints to these rearrangements (e.g., each rearrangement can affect at most a given number of genes), considering that a rearrangement implies a cost depending on its length rather than a unit cost, etc. Most of the works, however, have overlooked some important features inside genomes, such as the presence of sequences of nucleotides between genes, called intergenic regions. RESULTS AND CONCLUSIONS: In this work, we investigate the problem of computing the distance between two genomes, taking into account both gene order and intergenic sizes. The genome rearrangement operations we consider here are constrained types of reversals and transpositions, called super short reversals (SSRs) and super short transpositions (SSTs), which affect up to two (consecutive) genes. We denote by super short operations (SSOs) any SSR or SST. We show 3-approximation algorithms when the orientation of the genes is not considered when we allow SSRs, SSTs, or SSOs, and 5-approximation algorithms when considering the orientation for either SSRs or SSOs. We also show that these algorithms improve their approximation factors when the input permutation has a higher number of inversions, where the approximation factor decreases from 3 to either 2 or 1.5, and from 5 to either 3 or 2. BioMed Central 2019-11-05 /pmc/articles/PMC6833170/ /pubmed/31709002 http://dx.doi.org/10.1186/s13015-019-0156-5 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Oliveira, Andre R.
Jean, Géraldine
Fertin, Guillaume
Dias, Ulisses
Dias, Zanoni
Super short operations on both gene order and intergenic sizes
title Super short operations on both gene order and intergenic sizes
title_full Super short operations on both gene order and intergenic sizes
title_fullStr Super short operations on both gene order and intergenic sizes
title_full_unstemmed Super short operations on both gene order and intergenic sizes
title_short Super short operations on both gene order and intergenic sizes
title_sort super short operations on both gene order and intergenic sizes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6833170/
https://www.ncbi.nlm.nih.gov/pubmed/31709002
http://dx.doi.org/10.1186/s13015-019-0156-5
work_keys_str_mv AT oliveiraandrer supershortoperationsonbothgeneorderandintergenicsizes
AT jeangeraldine supershortoperationsonbothgeneorderandintergenicsizes
AT fertinguillaume supershortoperationsonbothgeneorderandintergenicsizes
AT diasulisses supershortoperationsonbothgeneorderandintergenicsizes
AT diaszanoni supershortoperationsonbothgeneorderandintergenicsizes