Cargando…

COSMO: A dynamic programming algorithm for multicriteria codon optimization

Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Taneda, Akito, Asai, Kiyoshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358382/
https://www.ncbi.nlm.nih.gov/pubmed/32695273
http://dx.doi.org/10.1016/j.csbj.2020.06.035
_version_ 1783558841599787008
author Taneda, Akito
Asai, Kiyoshi
author_facet Taneda, Akito
Asai, Kiyoshi
author_sort Taneda, Akito
collection PubMed
description Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence motifs is explored to optimize multiple criteria. Although evolutionary algorithms have been used to tackle such complex codon optimization problems, evolutionary codon optimization tools do not provide guarantees to find the optimal solutions for these multicriteria codon optimization problems. We have developed a novel multicriteria dynamic programming algorithm, COSMO. By using this algorithm, we can obtain all Pareto-optimal solutions for the multiple features of CDS, which include codon usage, codon context, and the number of hidden stop codons. User-prescribed forbidden sequence motifs are rigorously excluded from the Pareto-optimal solutions. To accelerate CDS design by COSMO, we introduced constraints that reduce the number of Pareto-optimal solutions to be processed in a branch-and-bound manner. We benchmarked COSMO for run-time and the number of generated solutions by adapting selected human genes to yeast codon usage frequencies, and found that the constraints effectively reduce the run-time. In addition to the benchmarking of COSMO, a multi-objective genetic algorithm (MOGA) for CDS design was also benchmarked for the same two aspects and their performances were compared. In this comparison, (i) MOGA identified significantly fewer Pareto-optimal solutions than COSMO, and (ii) the MOGA solutions did not achieve the same mean hypervolume values as those provided by COSMO. These results suggest that generating the whole set of the Pareto-optimal solutions of the codon optimization problems is a difficult task for MOGA.
format Online
Article
Text
id pubmed-7358382
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-73583822020-07-20 COSMO: A dynamic programming algorithm for multicriteria codon optimization Taneda, Akito Asai, Kiyoshi Comput Struct Biotechnol J Method Article Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence motifs is explored to optimize multiple criteria. Although evolutionary algorithms have been used to tackle such complex codon optimization problems, evolutionary codon optimization tools do not provide guarantees to find the optimal solutions for these multicriteria codon optimization problems. We have developed a novel multicriteria dynamic programming algorithm, COSMO. By using this algorithm, we can obtain all Pareto-optimal solutions for the multiple features of CDS, which include codon usage, codon context, and the number of hidden stop codons. User-prescribed forbidden sequence motifs are rigorously excluded from the Pareto-optimal solutions. To accelerate CDS design by COSMO, we introduced constraints that reduce the number of Pareto-optimal solutions to be processed in a branch-and-bound manner. We benchmarked COSMO for run-time and the number of generated solutions by adapting selected human genes to yeast codon usage frequencies, and found that the constraints effectively reduce the run-time. In addition to the benchmarking of COSMO, a multi-objective genetic algorithm (MOGA) for CDS design was also benchmarked for the same two aspects and their performances were compared. In this comparison, (i) MOGA identified significantly fewer Pareto-optimal solutions than COSMO, and (ii) the MOGA solutions did not achieve the same mean hypervolume values as those provided by COSMO. These results suggest that generating the whole set of the Pareto-optimal solutions of the codon optimization problems is a difficult task for MOGA. Research Network of Computational and Structural Biotechnology 2020-06-30 /pmc/articles/PMC7358382/ /pubmed/32695273 http://dx.doi.org/10.1016/j.csbj.2020.06.035 Text en © 2020 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Method Article
Taneda, Akito
Asai, Kiyoshi
COSMO: A dynamic programming algorithm for multicriteria codon optimization
title COSMO: A dynamic programming algorithm for multicriteria codon optimization
title_full COSMO: A dynamic programming algorithm for multicriteria codon optimization
title_fullStr COSMO: A dynamic programming algorithm for multicriteria codon optimization
title_full_unstemmed COSMO: A dynamic programming algorithm for multicriteria codon optimization
title_short COSMO: A dynamic programming algorithm for multicriteria codon optimization
title_sort cosmo: a dynamic programming algorithm for multicriteria codon optimization
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358382/
https://www.ncbi.nlm.nih.gov/pubmed/32695273
http://dx.doi.org/10.1016/j.csbj.2020.06.035
work_keys_str_mv AT tanedaakito cosmoadynamicprogrammingalgorithmformulticriteriacodonoptimization
AT asaikiyoshi cosmoadynamicprogrammingalgorithmformulticriteriacodonoptimization