Cargando…

Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment

The advance in human genome sequencing technology has significantly reduced the cost of data generation and overwhelms the computing capability of sequence analysis. Efficiency, efficacy, and scalability remain challenging in sequence alignment, which is an important and foundational operation for g...

Descripción completa

Detalles Bibliográficos
Autores principales: Roozgard, Aminmohammad, Barzigar, Nafise, Wang, Shuang, Jiang, Xiaoqian, Cheng, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426956/
https://www.ncbi.nlm.nih.gov/pubmed/25983537
http://dx.doi.org/10.4137/CIN.S13887
_version_ 1782370658975481856
author Roozgard, Aminmohammad
Barzigar, Nafise
Wang, Shuang
Jiang, Xiaoqian
Cheng, Samuel
author_facet Roozgard, Aminmohammad
Barzigar, Nafise
Wang, Shuang
Jiang, Xiaoqian
Cheng, Samuel
author_sort Roozgard, Aminmohammad
collection PubMed
description The advance in human genome sequencing technology has significantly reduced the cost of data generation and overwhelms the computing capability of sequence analysis. Efficiency, efficacy, and scalability remain challenging in sequence alignment, which is an important and foundational operation for genome data analysis. In this paper, we propose a two-stage approach to tackle this problem. In the preprocessing step, we match blocks of reference and target sequences based on the similarities between their empirical transition probability distributions using belief propagation. We then conduct a refined match using our recently published sparse-coding belief propagation (SCoBeP) technique. Our experimental results demonstrated robustness in nucleotide sequence alignment, and our results are competitive to those of the SOAP aligner and the BWA algorithm. Moreover, compared to SCoBeP alignment, the proposed technique can handle sequences of much longer lengths.
format Online
Article
Text
id pubmed-4426956
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-44269562015-05-15 Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment Roozgard, Aminmohammad Barzigar, Nafise Wang, Shuang Jiang, Xiaoqian Cheng, Samuel Cancer Inform Methodology The advance in human genome sequencing technology has significantly reduced the cost of data generation and overwhelms the computing capability of sequence analysis. Efficiency, efficacy, and scalability remain challenging in sequence alignment, which is an important and foundational operation for genome data analysis. In this paper, we propose a two-stage approach to tackle this problem. In the preprocessing step, we match blocks of reference and target sequences based on the similarities between their empirical transition probability distributions using belief propagation. We then conduct a refined match using our recently published sparse-coding belief propagation (SCoBeP) technique. Our experimental results demonstrated robustness in nucleotide sequence alignment, and our results are competitive to those of the SOAP aligner and the BWA algorithm. Moreover, compared to SCoBeP alignment, the proposed technique can handle sequences of much longer lengths. Libertas Academica 2015-02-01 /pmc/articles/PMC4426956/ /pubmed/25983537 http://dx.doi.org/10.4137/CIN.S13887 Text en © 2014 the author(s), publisher and licensee Libertas Academica Limited This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Methodology
Roozgard, Aminmohammad
Barzigar, Nafise
Wang, Shuang
Jiang, Xiaoqian
Cheng, Samuel
Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title_full Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title_fullStr Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title_full_unstemmed Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title_short Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
title_sort empirical transition probability indexing sparse-coding belief propagation (etpi-scobep) genome sequence alignment
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426956/
https://www.ncbi.nlm.nih.gov/pubmed/25983537
http://dx.doi.org/10.4137/CIN.S13887
work_keys_str_mv AT roozgardaminmohammad empiricaltransitionprobabilityindexingsparsecodingbeliefpropagationetpiscobepgenomesequencealignment
AT barzigarnafise empiricaltransitionprobabilityindexingsparsecodingbeliefpropagationetpiscobepgenomesequencealignment
AT wangshuang empiricaltransitionprobabilityindexingsparsecodingbeliefpropagationetpiscobepgenomesequencealignment
AT jiangxiaoqian empiricaltransitionprobabilityindexingsparsecodingbeliefpropagationetpiscobepgenomesequencealignment
AT chengsamuel empiricaltransitionprobabilityindexingsparsecodingbeliefpropagationetpiscobepgenomesequencealignment