Cargando…

FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads

The most crucial step in data processing from high-throughput sequencing applications is the accurate and sensitive alignment of the sequencing reads to reference genomes or transcriptomes. The accurate detection of insertions and deletions (indels) and errors introduced by the sequencing platform o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Gong, Fedyunin, Ivan, Kirchner, Sebastian, Xiao, Chuanle, Valleriani, Angelo, Ignatova, Zoya
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2012
Materias:	Methods Online
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3367211/ https://www.ncbi.nlm.nih.gov/pubmed/22379138 http://dx.doi.org/10.1093/nar/gks196

_version_	1782234821026643968
author	Zhang, Gong Fedyunin, Ivan Kirchner, Sebastian Xiao, Chuanle Valleriani, Angelo Ignatova, Zoya
author_facet	Zhang, Gong Fedyunin, Ivan Kirchner, Sebastian Xiao, Chuanle Valleriani, Angelo Ignatova, Zoya
author_sort	Zhang, Gong
collection	PubMed
description	The most crucial step in data processing from high-throughput sequencing applications is the accurate and sensitive alignment of the sequencing reads to reference genomes or transcriptomes. The accurate detection of insertions and deletions (indels) and errors introduced by the sequencing platform or by misreading of modified nucleotides is essential for the quantitative processing of the RNA-based sequencing (RNA-Seq) datasets and for the identification of genetic variations and modification patterns. We developed a new, fast and accurate algorithm for nucleic acid sequence analysis, FANSe, with adjustable mismatch allowance settings and ability to handle indels to accurately and quantitatively map millions of reads to small or large reference genomes. It is a seed-based algorithm which uses the whole read information for mapping and high sensitivity and low ambiguity are achieved by using short and non-overlapping reads. Furthermore, FANSe uses hotspot score to prioritize the processing of highly possible matches and implements modified Smith–Watermann refinement with reduced scoring matrix to accelerate the calculation without compromising its sensitivity. The FANSe algorithm stably processes datasets from various sequencing platforms, masked or unmasked and small or large genomes. It shows a remarkable coverage of low-abundance mRNAs which is important for quantitative processing of RNA-Seq datasets.
format	Online Article Text
id	pubmed-3367211
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-33672112012-06-05 FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads Zhang, Gong Fedyunin, Ivan Kirchner, Sebastian Xiao, Chuanle Valleriani, Angelo Ignatova, Zoya Nucleic Acids Res Methods Online The most crucial step in data processing from high-throughput sequencing applications is the accurate and sensitive alignment of the sequencing reads to reference genomes or transcriptomes. The accurate detection of insertions and deletions (indels) and errors introduced by the sequencing platform or by misreading of modified nucleotides is essential for the quantitative processing of the RNA-based sequencing (RNA-Seq) datasets and for the identification of genetic variations and modification patterns. We developed a new, fast and accurate algorithm for nucleic acid sequence analysis, FANSe, with adjustable mismatch allowance settings and ability to handle indels to accurately and quantitatively map millions of reads to small or large reference genomes. It is a seed-based algorithm which uses the whole read information for mapping and high sensitivity and low ambiguity are achieved by using short and non-overlapping reads. Furthermore, FANSe uses hotspot score to prioritize the processing of highly possible matches and implements modified Smith–Watermann refinement with reduced scoring matrix to accelerate the calculation without compromising its sensitivity. The FANSe algorithm stably processes datasets from various sequencing platforms, masked or unmasked and small or large genomes. It shows a remarkable coverage of low-abundance mRNAs which is important for quantitative processing of RNA-Seq datasets. Oxford University Press 2012-06 2012-02-29 /pmc/articles/PMC3367211/ /pubmed/22379138 http://dx.doi.org/10.1093/nar/gks196 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methods Online Zhang, Gong Fedyunin, Ivan Kirchner, Sebastian Xiao, Chuanle Valleriani, Angelo Ignatova, Zoya FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title	FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title_full	FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title_fullStr	FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title_full_unstemmed	FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title_short	FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads
title_sort	fanse: an accurate algorithm for quantitative mapping of large scale sequencing reads
topic	Methods Online
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3367211/ https://www.ncbi.nlm.nih.gov/pubmed/22379138 http://dx.doi.org/10.1093/nar/gks196
work_keys_str_mv	AT zhanggong fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads AT fedyuninivan fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads AT kirchnersebastian fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads AT xiaochuanle fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads AT vallerianiangelo fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads AT ignatovazoya fanseanaccuratealgorithmforquantitativemappingoflargescalesequencingreads

FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads

Ejemplares similares