Cargando…

DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences

A DNA barcode is a short piece of standard DNA sequence used for species determination and discrimination. Representation of DNA barcodes is essential for DNA barcodes’ applications in the transportation and recognition of biological materials. Previously, we have compared different strategies for r...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yujun, Yao, Xinjing, Liu, Rui, Liu, Chang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9844917/
https://www.ncbi.nlm.nih.gov/pubmed/36649276
http://dx.doi.org/10.1371/journal.pone.0279994
_version_ 1784870767171731456
author Wang, Yujun
Yao, Xinjing
Liu, Rui
Liu, Chang
author_facet Wang, Yujun
Yao, Xinjing
Liu, Rui
Liu, Chang
author_sort Wang, Yujun
collection PubMed
description A DNA barcode is a short piece of standard DNA sequence used for species determination and discrimination. Representation of DNA barcodes is essential for DNA barcodes’ applications in the transportation and recognition of biological materials. Previously, we have compared different strategies for representing the DNA barcodes. In the present study, we have developed a compression algorithm based on binary coding or Huffman coding scheme, followed by converting the binary digits into Base64 digits. The combination of this compression algorithm and the QR representation leads to the dynamic DNA QR coding algorithm (DDQR). We tested the DDQR algorithm on simulated data and real DNA barcode sequences from the commonly used plant and animal DNA barcode markers: rbcL, matK, trnH-psbA, ITS2, and COI. We compared the compression efficiency of DDQR and another state-of-the-art DNA compression algorithm GeCo3 for sequences with various base compositions and lengths. We found that DDQR had a higher compression rate than GeCo3 for DNA sequences shorter than 800 bp, which is the typical size range for DNA barcodes. We also upgraded a web server (http://www.1kmpg.cn/ddqr) that provides three functions: retrieval of DNA barcode sequences, encoding DNA barcode sequences to DDQR codes, and decoding DDQR codes to DNA barcode sequences. The DDQR algorithm and the webserver will be invaluable to applying DNA barcode technology in the food and traditional medicine industries.
format Online
Article
Text
id pubmed-9844917
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-98449172023-01-18 DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences Wang, Yujun Yao, Xinjing Liu, Rui Liu, Chang PLoS One Research Article A DNA barcode is a short piece of standard DNA sequence used for species determination and discrimination. Representation of DNA barcodes is essential for DNA barcodes’ applications in the transportation and recognition of biological materials. Previously, we have compared different strategies for representing the DNA barcodes. In the present study, we have developed a compression algorithm based on binary coding or Huffman coding scheme, followed by converting the binary digits into Base64 digits. The combination of this compression algorithm and the QR representation leads to the dynamic DNA QR coding algorithm (DDQR). We tested the DDQR algorithm on simulated data and real DNA barcode sequences from the commonly used plant and animal DNA barcode markers: rbcL, matK, trnH-psbA, ITS2, and COI. We compared the compression efficiency of DDQR and another state-of-the-art DNA compression algorithm GeCo3 for sequences with various base compositions and lengths. We found that DDQR had a higher compression rate than GeCo3 for DNA sequences shorter than 800 bp, which is the typical size range for DNA barcodes. We also upgraded a web server (http://www.1kmpg.cn/ddqr) that provides three functions: retrieval of DNA barcode sequences, encoding DNA barcode sequences to DDQR codes, and decoding DDQR codes to DNA barcode sequences. The DDQR algorithm and the webserver will be invaluable to applying DNA barcode technology in the food and traditional medicine industries. Public Library of Science 2023-01-17 /pmc/articles/PMC9844917/ /pubmed/36649276 http://dx.doi.org/10.1371/journal.pone.0279994 Text en © 2023 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Yujun
Yao, Xinjing
Liu, Rui
Liu, Chang
DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title_full DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title_fullStr DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title_full_unstemmed DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title_short DDQR (dynamic DNA QR coding): An efficient algorithm to represent DNA barcode sequences
title_sort ddqr (dynamic dna qr coding): an efficient algorithm to represent dna barcode sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9844917/
https://www.ncbi.nlm.nih.gov/pubmed/36649276
http://dx.doi.org/10.1371/journal.pone.0279994
work_keys_str_mv AT wangyujun ddqrdynamicdnaqrcodinganefficientalgorithmtorepresentdnabarcodesequences
AT yaoxinjing ddqrdynamicdnaqrcodinganefficientalgorithmtorepresentdnabarcodesequences
AT liurui ddqrdynamicdnaqrcodinganefficientalgorithmtorepresentdnabarcodesequences
AT liuchang ddqrdynamicdnaqrcodinganefficientalgorithmtorepresentdnabarcodesequences