Cargando…
COSMO: A dynamic programming algorithm for multicriteria codon optimization
Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358382/ https://www.ncbi.nlm.nih.gov/pubmed/32695273 http://dx.doi.org/10.1016/j.csbj.2020.06.035 |
_version_ | 1783558841599787008 |
---|---|
author | Taneda, Akito Asai, Kiyoshi |
author_facet | Taneda, Akito Asai, Kiyoshi |
author_sort | Taneda, Akito |
collection | PubMed |
description | Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence motifs is explored to optimize multiple criteria. Although evolutionary algorithms have been used to tackle such complex codon optimization problems, evolutionary codon optimization tools do not provide guarantees to find the optimal solutions for these multicriteria codon optimization problems. We have developed a novel multicriteria dynamic programming algorithm, COSMO. By using this algorithm, we can obtain all Pareto-optimal solutions for the multiple features of CDS, which include codon usage, codon context, and the number of hidden stop codons. User-prescribed forbidden sequence motifs are rigorously excluded from the Pareto-optimal solutions. To accelerate CDS design by COSMO, we introduced constraints that reduce the number of Pareto-optimal solutions to be processed in a branch-and-bound manner. We benchmarked COSMO for run-time and the number of generated solutions by adapting selected human genes to yeast codon usage frequencies, and found that the constraints effectively reduce the run-time. In addition to the benchmarking of COSMO, a multi-objective genetic algorithm (MOGA) for CDS design was also benchmarked for the same two aspects and their performances were compared. In this comparison, (i) MOGA identified significantly fewer Pareto-optimal solutions than COSMO, and (ii) the MOGA solutions did not achieve the same mean hypervolume values as those provided by COSMO. These results suggest that generating the whole set of the Pareto-optimal solutions of the codon optimization problems is a difficult task for MOGA. |
format | Online Article Text |
id | pubmed-7358382 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-73583822020-07-20 COSMO: A dynamic programming algorithm for multicriteria codon optimization Taneda, Akito Asai, Kiyoshi Comput Struct Biotechnol J Method Article Codon optimization in protein-coding sequences (CDSs) is a widely used technique to promote the heterologous expression of target genes. In codon optimization, a combinatorial space of nucleotide sequences that code a given amino acid sequence and take into account user-prescribed forbidden sequence motifs is explored to optimize multiple criteria. Although evolutionary algorithms have been used to tackle such complex codon optimization problems, evolutionary codon optimization tools do not provide guarantees to find the optimal solutions for these multicriteria codon optimization problems. We have developed a novel multicriteria dynamic programming algorithm, COSMO. By using this algorithm, we can obtain all Pareto-optimal solutions for the multiple features of CDS, which include codon usage, codon context, and the number of hidden stop codons. User-prescribed forbidden sequence motifs are rigorously excluded from the Pareto-optimal solutions. To accelerate CDS design by COSMO, we introduced constraints that reduce the number of Pareto-optimal solutions to be processed in a branch-and-bound manner. We benchmarked COSMO for run-time and the number of generated solutions by adapting selected human genes to yeast codon usage frequencies, and found that the constraints effectively reduce the run-time. In addition to the benchmarking of COSMO, a multi-objective genetic algorithm (MOGA) for CDS design was also benchmarked for the same two aspects and their performances were compared. In this comparison, (i) MOGA identified significantly fewer Pareto-optimal solutions than COSMO, and (ii) the MOGA solutions did not achieve the same mean hypervolume values as those provided by COSMO. These results suggest that generating the whole set of the Pareto-optimal solutions of the codon optimization problems is a difficult task for MOGA. Research Network of Computational and Structural Biotechnology 2020-06-30 /pmc/articles/PMC7358382/ /pubmed/32695273 http://dx.doi.org/10.1016/j.csbj.2020.06.035 Text en © 2020 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Method Article Taneda, Akito Asai, Kiyoshi COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title | COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title_full | COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title_fullStr | COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title_full_unstemmed | COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title_short | COSMO: A dynamic programming algorithm for multicriteria codon optimization |
title_sort | cosmo: a dynamic programming algorithm for multicriteria codon optimization |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358382/ https://www.ncbi.nlm.nih.gov/pubmed/32695273 http://dx.doi.org/10.1016/j.csbj.2020.06.035 |
work_keys_str_mv | AT tanedaakito cosmoadynamicprogrammingalgorithmformulticriteriacodonoptimization AT asaikiyoshi cosmoadynamicprogrammingalgorithmformulticriteriacodonoptimization |