Cargando…

High information capacity DNA-based data storage with augmented encoding characters using degenerate bases

DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Yeongjae, Ryu, Taehoon, Lee, Amos C., Choi, Hansol, Lee, Hansaem, Park, Jaejun, Song, Suk-Heung, Kim, Seojoo, Kim, Hyeli, Park, Wook, Kwon, Sunghoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6488701/
https://www.ncbi.nlm.nih.gov/pubmed/31036920
http://dx.doi.org/10.1038/s41598-019-43105-w
_version_ 1783414693132500992
author Choi, Yeongjae
Ryu, Taehoon
Lee, Amos C.
Choi, Hansol
Lee, Hansaem
Park, Jaejun
Song, Suk-Heung
Kim, Seojoo
Kim, Hyeli
Park, Wook
Kwon, Sunghoon
author_facet Choi, Yeongjae
Ryu, Taehoon
Lee, Amos C.
Choi, Hansol
Lee, Hansaem
Park, Jaejun
Song, Suk-Heung
Kim, Seojoo
Kim, Hyeli
Park, Wook
Kwon, Sunghoon
author_sort Choi, Yeongjae
collection PubMed
description DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of degenerate bases as encoding characters in addition to A, C, G, and T, which augments the amount of data that can be stored per length of DNA sequence designed (information capacity) and lowering the amount of DNA synthesis per storing unit data. Using the proposed method, we experimentally achieved an information capacity of 3.37 bits/character. The demonstrated information capacity is more than twice when compared to the highest information capacity previously achieved. The proposed method can be integrated with synthetic technologies in the future to reduce the cost of DNA-based data storage by 50%.
format Online
Article
Text
id pubmed-6488701
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-64887012019-05-16 High information capacity DNA-based data storage with augmented encoding characters using degenerate bases Choi, Yeongjae Ryu, Taehoon Lee, Amos C. Choi, Hansol Lee, Hansaem Park, Jaejun Song, Suk-Heung Kim, Seojoo Kim, Hyeli Park, Wook Kwon, Sunghoon Sci Rep Article DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of degenerate bases as encoding characters in addition to A, C, G, and T, which augments the amount of data that can be stored per length of DNA sequence designed (information capacity) and lowering the amount of DNA synthesis per storing unit data. Using the proposed method, we experimentally achieved an information capacity of 3.37 bits/character. The demonstrated information capacity is more than twice when compared to the highest information capacity previously achieved. The proposed method can be integrated with synthetic technologies in the future to reduce the cost of DNA-based data storage by 50%. Nature Publishing Group UK 2019-04-29 /pmc/articles/PMC6488701/ /pubmed/31036920 http://dx.doi.org/10.1038/s41598-019-43105-w Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Choi, Yeongjae
Ryu, Taehoon
Lee, Amos C.
Choi, Hansol
Lee, Hansaem
Park, Jaejun
Song, Suk-Heung
Kim, Seojoo
Kim, Hyeli
Park, Wook
Kwon, Sunghoon
High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_full High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_fullStr High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_full_unstemmed High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_short High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_sort high information capacity dna-based data storage with augmented encoding characters using degenerate bases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6488701/
https://www.ncbi.nlm.nih.gov/pubmed/31036920
http://dx.doi.org/10.1038/s41598-019-43105-w
work_keys_str_mv AT choiyeongjae highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT ryutaehoon highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT leeamosc highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT choihansol highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT leehansaem highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT parkjaejun highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT songsukheung highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT kimseojoo highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT kimhyeli highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT parkwook highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT kwonsunghoon highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases