Cargando…

BM-BC: a Bayesian method of base calling for Solexa sequence data

Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of D...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Yuan, Mitra, Riten, Quintana, Fernando, Jara, Alejandro, Mueller, Peter, Liu, Ping, Lu, Yue, Liang, Shoudan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426806/
https://www.ncbi.nlm.nih.gov/pubmed/23320938
http://dx.doi.org/10.1186/1471-2105-13-S13-S6
_version_ 1782241546463084544
author Ji, Yuan
Mitra, Riten
Quintana, Fernando
Jara, Alejandro
Mueller, Peter
Liu, Ping
Lu, Yue
Liang, Shoudan
author_facet Ji, Yuan
Mitra, Riten
Quintana, Fernando
Jara, Alejandro
Mueller, Peter
Liu, Ping
Lu, Yue
Liang, Shoudan
author_sort Ji, Yuan
collection PubMed
description Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of DNA or RNA. We present a Bayesian method of base calling, BM-BC, for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels. We show that the new method improves the precision of base calling compared with currently leading methods. Furthermore, the proposed method provides a probability score that measures the confidence of each base call. This probability score can be used to estimate the false discovery rate of the base calling or to rank the precision of the estimated DNA sequences, which in turn can be useful for downstream analysis such as sequence alignment.
format Online
Article
Text
id pubmed-3426806
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34268062012-08-24 BM-BC: a Bayesian method of base calling for Solexa sequence data Ji, Yuan Mitra, Riten Quintana, Fernando Jara, Alejandro Mueller, Peter Liu, Ping Lu, Yue Liang, Shoudan BMC Bioinformatics Research Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of DNA or RNA. We present a Bayesian method of base calling, BM-BC, for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels. We show that the new method improves the precision of base calling compared with currently leading methods. Furthermore, the proposed method provides a probability score that measures the confidence of each base call. This probability score can be used to estimate the false discovery rate of the base calling or to rank the precision of the estimated DNA sequences, which in turn can be useful for downstream analysis such as sequence alignment. BioMed Central 2012-08-24 /pmc/articles/PMC3426806/ /pubmed/23320938 http://dx.doi.org/10.1186/1471-2105-13-S13-S6 Text en Copyright ©2012 Ji et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ji, Yuan
Mitra, Riten
Quintana, Fernando
Jara, Alejandro
Mueller, Peter
Liu, Ping
Lu, Yue
Liang, Shoudan
BM-BC: a Bayesian method of base calling for Solexa sequence data
title BM-BC: a Bayesian method of base calling for Solexa sequence data
title_full BM-BC: a Bayesian method of base calling for Solexa sequence data
title_fullStr BM-BC: a Bayesian method of base calling for Solexa sequence data
title_full_unstemmed BM-BC: a Bayesian method of base calling for Solexa sequence data
title_short BM-BC: a Bayesian method of base calling for Solexa sequence data
title_sort bm-bc: a bayesian method of base calling for solexa sequence data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426806/
https://www.ncbi.nlm.nih.gov/pubmed/23320938
http://dx.doi.org/10.1186/1471-2105-13-S13-S6
work_keys_str_mv AT jiyuan bmbcabayesianmethodofbasecallingforsolexasequencedata
AT mitrariten bmbcabayesianmethodofbasecallingforsolexasequencedata
AT quintanafernando bmbcabayesianmethodofbasecallingforsolexasequencedata
AT jaraalejandro bmbcabayesianmethodofbasecallingforsolexasequencedata
AT muellerpeter bmbcabayesianmethodofbasecallingforsolexasequencedata
AT liuping bmbcabayesianmethodofbasecallingforsolexasequencedata
AT luyue bmbcabayesianmethodofbasecallingforsolexasequencedata
AT liangshoudan bmbcabayesianmethodofbasecallingforsolexasequencedata