Cargando…
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original se...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5431517/ https://www.ncbi.nlm.nih.gov/pubmed/28484262 http://dx.doi.org/10.1038/s41598-017-01703-6 |
_version_ | 1783236443098841088 |
---|---|
author | Au, Chun Hang Ho, Dona N. Kwong, Ava Chan, Tsun Leung Ma, Edmond S. K. |
author_facet | Au, Chun Hang Ho, Dona N. Kwong, Ava Chan, Tsun Leung Ma, Edmond S. K. |
author_sort | Au, Chun Hang |
collection | PubMed |
description | Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper. |
format | Online Article Text |
id | pubmed-5431517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-54315172017-05-16 BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing Au, Chun Hang Ho, Dona N. Kwong, Ava Chan, Tsun Leung Ma, Edmond S. K. Sci Rep Article Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper. Nature Publishing Group UK 2017-05-08 /pmc/articles/PMC5431517/ /pubmed/28484262 http://dx.doi.org/10.1038/s41598-017-01703-6 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Au, Chun Hang Ho, Dona N. Kwong, Ava Chan, Tsun Leung Ma, Edmond S. K. BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title | BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_full | BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_fullStr | BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_full_unstemmed | BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_short | BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_sort | bamclipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5431517/ https://www.ncbi.nlm.nih.gov/pubmed/28484262 http://dx.doi.org/10.1038/s41598-017-01703-6 |
work_keys_str_mv | AT auchunhang bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT hodonan bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT kwongava bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT chantsunleung bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT maedmondsk bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing |