Cargando…

Human mitochondrial genome compression using machine learning techniques

BACKGROUND: In recent years, with the development of high-throughput genome sequencing technologies, a large amount of genome data has been generated, which has caused widespread concern about data storage and transmission costs. However, how to effectively compression genome sequences data remains...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Rongjie, Zang, Tianyi, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805717/
https://www.ncbi.nlm.nih.gov/pubmed/31639043
http://dx.doi.org/10.1186/s40246-019-0225-3
_version_ 1783461457787092992
author Wang, Rongjie
Zang, Tianyi
Wang, Yadong
author_facet Wang, Rongjie
Zang, Tianyi
Wang, Yadong
author_sort Wang, Rongjie
collection PubMed
description BACKGROUND: In recent years, with the development of high-throughput genome sequencing technologies, a large amount of genome data has been generated, which has caused widespread concern about data storage and transmission costs. However, how to effectively compression genome sequences data remains an unsolved problem. RESULTS: In this paper, we propose a compression method using machine learning techniques (DeepDNA), for compressing human mitochondrial genome data. The experimental results show the effectiveness of our proposed method compared with other on the human mitochondrial genome data. CONCLUSIONS: The compression method we proposed can be classified as non-reference based method, but the compression effect is comparable to that of reference based methods. Moreover, our method not only have a well compression results in the population genome with large redundancy, but also in the single genome with small redundancy. The codes of DeepDNA are available at https://github.com/rongjiewang/DeepDNA.
format Online
Article
Text
id pubmed-6805717
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68057172019-10-28 Human mitochondrial genome compression using machine learning techniques Wang, Rongjie Zang, Tianyi Wang, Yadong Hum Genomics Research BACKGROUND: In recent years, with the development of high-throughput genome sequencing technologies, a large amount of genome data has been generated, which has caused widespread concern about data storage and transmission costs. However, how to effectively compression genome sequences data remains an unsolved problem. RESULTS: In this paper, we propose a compression method using machine learning techniques (DeepDNA), for compressing human mitochondrial genome data. The experimental results show the effectiveness of our proposed method compared with other on the human mitochondrial genome data. CONCLUSIONS: The compression method we proposed can be classified as non-reference based method, but the compression effect is comparable to that of reference based methods. Moreover, our method not only have a well compression results in the population genome with large redundancy, but also in the single genome with small redundancy. The codes of DeepDNA are available at https://github.com/rongjiewang/DeepDNA. BioMed Central 2019-10-22 /pmc/articles/PMC6805717/ /pubmed/31639043 http://dx.doi.org/10.1186/s40246-019-0225-3 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wang, Rongjie
Zang, Tianyi
Wang, Yadong
Human mitochondrial genome compression using machine learning techniques
title Human mitochondrial genome compression using machine learning techniques
title_full Human mitochondrial genome compression using machine learning techniques
title_fullStr Human mitochondrial genome compression using machine learning techniques
title_full_unstemmed Human mitochondrial genome compression using machine learning techniques
title_short Human mitochondrial genome compression using machine learning techniques
title_sort human mitochondrial genome compression using machine learning techniques
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805717/
https://www.ncbi.nlm.nih.gov/pubmed/31639043
http://dx.doi.org/10.1186/s40246-019-0225-3
work_keys_str_mv AT wangrongjie humanmitochondrialgenomecompressionusingmachinelearningtechniques
AT zangtianyi humanmitochondrialgenomecompressionusingmachinelearningtechniques
AT wangyadong humanmitochondrialgenomecompressionusingmachinelearningtechniques