Cargando…

Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression

Alignment-free methods, more time and memory efficient than alignment-based methods, have been widely used for comparing genome sequences or raw sequencing samples without assembly. However, in this study, we show that alignment-free dissimilarity calculated based on sequencing samples can be overes...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Kujin, Ren, Jie, Sun, Fengzhu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6891986/
https://www.ncbi.nlm.nih.gov/pubmed/31801606
http://dx.doi.org/10.1186/s13059-019-1872-3
_version_ 1783475939399696384
author Tang, Kujin
Ren, Jie
Sun, Fengzhu
author_facet Tang, Kujin
Ren, Jie
Sun, Fengzhu
author_sort Tang, Kujin
collection PubMed
description Alignment-free methods, more time and memory efficient than alignment-based methods, have been widely used for comparing genome sequences or raw sequencing samples without assembly. However, in this study, we show that alignment-free dissimilarity calculated based on sequencing samples can be overestimated compared with the dissimilarity calculated based on their genomes, and this bias can significantly decrease the performance of the alignment-free analysis. Here, we introduce a new alignment-free tool, Alignment-Free methods Adjusted by Neural Network (Afann) that successfully adjusts this bias and achieves excellent performance on various independent datasets. Afann is freely available at https://github.com/GeniusTang/Afann.
format Online
Article
Text
id pubmed-6891986
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68919862019-12-11 Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression Tang, Kujin Ren, Jie Sun, Fengzhu Genome Biol Method Alignment-free methods, more time and memory efficient than alignment-based methods, have been widely used for comparing genome sequences or raw sequencing samples without assembly. However, in this study, we show that alignment-free dissimilarity calculated based on sequencing samples can be overestimated compared with the dissimilarity calculated based on their genomes, and this bias can significantly decrease the performance of the alignment-free analysis. Here, we introduce a new alignment-free tool, Alignment-Free methods Adjusted by Neural Network (Afann) that successfully adjusts this bias and achieves excellent performance on various independent datasets. Afann is freely available at https://github.com/GeniusTang/Afann. BioMed Central 2019-12-04 /pmc/articles/PMC6891986/ /pubmed/31801606 http://dx.doi.org/10.1186/s13059-019-1872-3 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Tang, Kujin
Ren, Jie
Sun, Fengzhu
Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title_full Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title_fullStr Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title_full_unstemmed Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title_short Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
title_sort afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6891986/
https://www.ncbi.nlm.nih.gov/pubmed/31801606
http://dx.doi.org/10.1186/s13059-019-1872-3
work_keys_str_mv AT tangkujin afannbiasadjustmentforalignmentfreesequencecomparisonbasedonsequencingdatausingneuralnetworkregression
AT renjie afannbiasadjustmentforalignmentfreesequencecomparisonbasedonsequencingdatausingneuralnetworkregression
AT sunfengzhu afannbiasadjustmentforalignmentfreesequencecomparisonbasedonsequencingdatausingneuralnetworkregression