Cargando…

Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies

BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Weixing, Zhao, Sen, Xue, Dingkai, Song, Fengfei, Li, Ziwei, Chen, Duojiao, He, Bo, Hao, Yangyang, Wang, Yadong, Liu, Yunlong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001236/
https://www.ncbi.nlm.nih.gov/pubmed/27556417
http://dx.doi.org/10.1186/s12864-016-2894-9
_version_ 1782450437025169408
author Feng, Weixing
Zhao, Sen
Xue, Dingkai
Song, Fengfei
Li, Ziwei
Chen, Duojiao
He, Bo
Hao, Yangyang
Wang, Yadong
Liu, Yunlong
author_facet Feng, Weixing
Zhao, Sen
Xue, Dingkai
Song, Fengfei
Li, Ziwei
Chen, Duojiao
He, Bo
Hao, Yangyang
Wang, Yadong
Liu, Yunlong
author_sort Feng, Weixing
collection PubMed
description BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants. RESULTS: In this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions. CONCLUSIONS: Besides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2894-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5001236
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50012362016-09-06 Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies Feng, Weixing Zhao, Sen Xue, Dingkai Song, Fengfei Li, Ziwei Chen, Duojiao He, Bo Hao, Yangyang Wang, Yadong Liu, Yunlong BMC Genomics Research BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants. RESULTS: In this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions. CONCLUSIONS: Besides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2894-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-22 /pmc/articles/PMC5001236/ /pubmed/27556417 http://dx.doi.org/10.1186/s12864-016-2894-9 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Feng, Weixing
Zhao, Sen
Xue, Dingkai
Song, Fengfei
Li, Ziwei
Chen, Duojiao
He, Bo
Hao, Yangyang
Wang, Yadong
Liu, Yunlong
Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title_full Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title_fullStr Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title_full_unstemmed Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title_short Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
title_sort improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001236/
https://www.ncbi.nlm.nih.gov/pubmed/27556417
http://dx.doi.org/10.1186/s12864-016-2894-9
work_keys_str_mv AT fengweixing improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT zhaosen improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT xuedingkai improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT songfengfei improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT liziwei improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT chenduojiao improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT hebo improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT haoyangyang improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT wangyadong improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies
AT liuyunlong improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies