Cargando…
FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variat...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383/ https://www.ncbi.nlm.nih.gov/pubmed/23284954 http://dx.doi.org/10.1371/journal.pone.0052249 |
_version_ | 1782253710979629056 |
---|---|
author | Xu, Haibin Luo, Xiang Qian, Jun Pang, Xiaohui Song, Jingyuan Qian, Guangrui Chen, Jinhui Chen, Shilin |
author_facet | Xu, Haibin Luo, Xiang Qian, Jun Pang, Xiaohui Song, Jingyuan Qian, Guangrui Chen, Jinhui Chen, Shilin |
author_sort | Xu, Haibin |
collection | PubMed |
description | The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/. |
format | Online Article Text |
id | pubmed-3527383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-35273832013-01-02 FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads Xu, Haibin Luo, Xiang Qian, Jun Pang, Xiaohui Song, Jingyuan Qian, Guangrui Chen, Jinhui Chen, Shilin PLoS One Research Article The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/. Public Library of Science 2012-12-20 /pmc/articles/PMC3527383/ /pubmed/23284954 http://dx.doi.org/10.1371/journal.pone.0052249 Text en © 2012 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Xu, Haibin Luo, Xiang Qian, Jun Pang, Xiaohui Song, Jingyuan Qian, Guangrui Chen, Jinhui Chen, Shilin FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title | FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title_full | FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title_fullStr | FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title_full_unstemmed | FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title_short | FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads |
title_sort | fastuniq: a fast de novo duplicates removal tool for paired short reads |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383/ https://www.ncbi.nlm.nih.gov/pubmed/23284954 http://dx.doi.org/10.1371/journal.pone.0052249 |
work_keys_str_mv | AT xuhaibin fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT luoxiang fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT qianjun fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT pangxiaohui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT songjingyuan fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT qianguangrui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT chenjinhui fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads AT chenshilin fastuniqafastdenovoduplicatesremovaltoolforpairedshortreads |