Cargando…
Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies
BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001236/ https://www.ncbi.nlm.nih.gov/pubmed/27556417 http://dx.doi.org/10.1186/s12864-016-2894-9 |
_version_ | 1782450437025169408 |
---|---|
author | Feng, Weixing Zhao, Sen Xue, Dingkai Song, Fengfei Li, Ziwei Chen, Duojiao He, Bo Hao, Yangyang Wang, Yadong Liu, Yunlong |
author_facet | Feng, Weixing Zhao, Sen Xue, Dingkai Song, Fengfei Li, Ziwei Chen, Duojiao He, Bo Hao, Yangyang Wang, Yadong Liu, Yunlong |
author_sort | Feng, Weixing |
collection | PubMed |
description | BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants. RESULTS: In this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions. CONCLUSIONS: Besides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2894-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5001236 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50012362016-09-06 Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies Feng, Weixing Zhao, Sen Xue, Dingkai Song, Fengfei Li, Ziwei Chen, Duojiao He, Bo Hao, Yangyang Wang, Yadong Liu, Yunlong BMC Genomics Research BACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants. RESULTS: In this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions. CONCLUSIONS: Besides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2894-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-22 /pmc/articles/PMC5001236/ /pubmed/27556417 http://dx.doi.org/10.1186/s12864-016-2894-9 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Feng, Weixing Zhao, Sen Xue, Dingkai Song, Fengfei Li, Ziwei Chen, Duojiao He, Bo Hao, Yangyang Wang, Yadong Liu, Yunlong Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title | Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title_full | Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title_fullStr | Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title_full_unstemmed | Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title_short | Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
title_sort | improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001236/ https://www.ncbi.nlm.nih.gov/pubmed/27556417 http://dx.doi.org/10.1186/s12864-016-2894-9 |
work_keys_str_mv | AT fengweixing improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT zhaosen improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT xuedingkai improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT songfengfei improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT liziwei improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT chenduojiao improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT hebo improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT haoyangyang improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT wangyadong improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies AT liuyunlong improvingalignmentaccuracyonhomopolymerregionsforsemiconductorbasedsequencingtechnologies |