Cargando…

Optimizing DNA assembly based on statistical language modelling

By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Gang, Zhang, Shemin, Dong, Yafei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5727464/
https://www.ncbi.nlm.nih.gov/pubmed/29036490
http://dx.doi.org/10.1093/nar/gkx859
_version_ 1783285888302710784
author Fang, Gang
Zhang, Shemin
Dong, Yafei
author_facet Fang, Gang
Zhang, Shemin
Dong, Yafei
author_sort Fang, Gang
collection PubMed
description By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the process of assembling more than a few sets of these parts can be expensive, time consuming and error prone. At the last step of assembling it is somewhat difficult to decide which part should be selected. Based on statistical language model, which is a probability distribution P(s) over strings S that attempts to reflect how frequently a string S occurs as a sentence, the most commonly used parts will be selected. Then, a dynamic programming algorithm was designed to figure out the solution of maximum probability. The algorithm optimizes the results of a genetic design based on a grammatical model and finds an optimal solution. In this way, redundant operations can be reduced and the time and cost required for conducting biological experiments can be minimized.
format Online
Article
Text
id pubmed-5727464
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57274642017-12-18 Optimizing DNA assembly based on statistical language modelling Fang, Gang Zhang, Shemin Dong, Yafei Nucleic Acids Res Methods Online By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the process of assembling more than a few sets of these parts can be expensive, time consuming and error prone. At the last step of assembling it is somewhat difficult to decide which part should be selected. Based on statistical language model, which is a probability distribution P(s) over strings S that attempts to reflect how frequently a string S occurs as a sentence, the most commonly used parts will be selected. Then, a dynamic programming algorithm was designed to figure out the solution of maximum probability. The algorithm optimizes the results of a genetic design based on a grammatical model and finds an optimal solution. In this way, redundant operations can be reduced and the time and cost required for conducting biological experiments can be minimized. Oxford University Press 2017-12-15 2017-09-28 /pmc/articles/PMC5727464/ /pubmed/29036490 http://dx.doi.org/10.1093/nar/gkx859 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Fang, Gang
Zhang, Shemin
Dong, Yafei
Optimizing DNA assembly based on statistical language modelling
title Optimizing DNA assembly based on statistical language modelling
title_full Optimizing DNA assembly based on statistical language modelling
title_fullStr Optimizing DNA assembly based on statistical language modelling
title_full_unstemmed Optimizing DNA assembly based on statistical language modelling
title_short Optimizing DNA assembly based on statistical language modelling
title_sort optimizing dna assembly based on statistical language modelling
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5727464/
https://www.ncbi.nlm.nih.gov/pubmed/29036490
http://dx.doi.org/10.1093/nar/gkx859
work_keys_str_mv AT fanggang optimizingdnaassemblybasedonstatisticallanguagemodelling
AT zhangshemin optimizingdnaassemblybasedonstatisticallanguagemodelling
AT dongyafei optimizingdnaassemblybasedonstatisticallanguagemodelling