Cargando…

FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads

The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variat...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Haibin, Luo, Xiang, Qian, Jun, Pang, Xiaohui, Song, Jingyuan, Qian, Guangrui, Chen, Jinhui, Chen, Shilin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383/
https://www.ncbi.nlm.nih.gov/pubmed/23284954
http://dx.doi.org/10.1371/journal.pone.0052249
_version_ 1782253710979629056
author Xu, Haibin
Luo, Xiang
Qian, Jun
Pang, Xiaohui
Song, Jingyuan
Qian, Guangrui
Chen, Jinhui
Chen, Shilin
author_facet Xu, Haibin
Luo, Xiang
Qian, Jun
Pang, Xiaohui
Song, Jingyuan
Qian, Guangrui
Chen, Jinhui
Chen, Shilin
author_sort Xu, Haibin
collection PubMed
description The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/.
format Online
Article
Text
id pubmed-3527383
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35273832013-01-02 FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads Xu, Haibin Luo, Xiang Qian, Jun Pang, Xiaohui Song, Jingyuan Qian, Guangrui Chen, Jinhui Chen, Shilin PLoS One Research Article The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/. Public Library of Science 2012-12-20 /pmc/articles/PMC3527383/ /pubmed/23284954 http://dx.doi.org/10.1371/journal.pone.0052249 Text en © 2012 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xu, Haibin
Luo, Xiang
Qian, Jun
Pang, Xiaohui
Song, Jingyuan
Qian, Guangrui
Chen, Jinhui
Chen, Shilin
FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title_full FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title_fullStr FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title_full_unstemmed FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title_short FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
title_sort fastuniq: a fast de novo duplicates removal tool for paired short reads
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383/
https://www.ncbi.nlm.nih.gov/pubmed/23284954
http://dx.doi.org/10.1371/journal.pone.0052249
work_keys_str_mv AT xuhaibin fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT luoxiang fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT qianjun fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT pangxiaohui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT songjingyuan fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT qianguangrui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT chenjinhui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads
AT chenshilin fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads