Cargando…
Multi-allelic positional Burrows-Wheeler transform
BACKGROUND: Recent advances in whole-genome sequencing and SNP array technology have led to the generation of a large amount of genotype data. Large volumes of genotype data will require faster and more efficient methods for storing and searching the data. Positional Burrows-Wheeler Transform (PBWT)...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551244/ https://www.ncbi.nlm.nih.gov/pubmed/31167638 http://dx.doi.org/10.1186/s12859-019-2821-6 |
_version_ | 1783424363171676160 |
---|---|
author | Naseri, Ardalan Zhi, Degui Zhang, Shaojie |
author_facet | Naseri, Ardalan Zhi, Degui Zhang, Shaojie |
author_sort | Naseri, Ardalan |
collection | PubMed |
description | BACKGROUND: Recent advances in whole-genome sequencing and SNP array technology have led to the generation of a large amount of genotype data. Large volumes of genotype data will require faster and more efficient methods for storing and searching the data. Positional Burrows-Wheeler Transform (PBWT) provides an appropriate data structure for bi-allelic data. With the increasing sample sizes, more multi-allelic sites are expected to be observed. Hence, there is a necessity to handle multi-allelic genotype data. RESULTS: In this paper, we introduce a multi-allelic version of the Positional Burrows-Wheeler Transform (mPBWT) based on the bi-allelic version for compression and searching. The time-complexity for constructing the data structure and searching within a panel containing t-allelic sites increases by a factor of t. CONCLUSION: Considering the small value for the possible alleles t, the time increase for the multi-allelic PBWT will be negligible and comparable to the bi-allelic version of PBWT. |
format | Online Article Text |
id | pubmed-6551244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65512442019-06-07 Multi-allelic positional Burrows-Wheeler transform Naseri, Ardalan Zhi, Degui Zhang, Shaojie BMC Bioinformatics Research BACKGROUND: Recent advances in whole-genome sequencing and SNP array technology have led to the generation of a large amount of genotype data. Large volumes of genotype data will require faster and more efficient methods for storing and searching the data. Positional Burrows-Wheeler Transform (PBWT) provides an appropriate data structure for bi-allelic data. With the increasing sample sizes, more multi-allelic sites are expected to be observed. Hence, there is a necessity to handle multi-allelic genotype data. RESULTS: In this paper, we introduce a multi-allelic version of the Positional Burrows-Wheeler Transform (mPBWT) based on the bi-allelic version for compression and searching. The time-complexity for constructing the data structure and searching within a panel containing t-allelic sites increases by a factor of t. CONCLUSION: Considering the small value for the possible alleles t, the time increase for the multi-allelic PBWT will be negligible and comparable to the bi-allelic version of PBWT. BioMed Central 2019-06-06 /pmc/articles/PMC6551244/ /pubmed/31167638 http://dx.doi.org/10.1186/s12859-019-2821-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Naseri, Ardalan Zhi, Degui Zhang, Shaojie Multi-allelic positional Burrows-Wheeler transform |
title | Multi-allelic positional Burrows-Wheeler transform |
title_full | Multi-allelic positional Burrows-Wheeler transform |
title_fullStr | Multi-allelic positional Burrows-Wheeler transform |
title_full_unstemmed | Multi-allelic positional Burrows-Wheeler transform |
title_short | Multi-allelic positional Burrows-Wheeler transform |
title_sort | multi-allelic positional burrows-wheeler transform |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551244/ https://www.ncbi.nlm.nih.gov/pubmed/31167638 http://dx.doi.org/10.1186/s12859-019-2821-6 |
work_keys_str_mv | AT naseriardalan multiallelicpositionalburrowswheelertransform AT zhidegui multiallelicpositionalburrowswheelertransform AT zhangshaojie multiallelicpositionalburrowswheelertransform |