Cargando…

BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing

Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original se...

Descripción completa

Detalles Bibliográficos
Autores principales: Au, Chun Hang, Ho, Dona N., Kwong, Ava, Chan, Tsun Leung, Ma, Edmond S. K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5431517/
https://www.ncbi.nlm.nih.gov/pubmed/28484262
http://dx.doi.org/10.1038/s41598-017-01703-6
_version_ 1783236443098841088
author Au, Chun Hang
Ho, Dona N.
Kwong, Ava
Chan, Tsun Leung
Ma, Edmond S. K.
author_facet Au, Chun Hang
Ho, Dona N.
Kwong, Ava
Chan, Tsun Leung
Ma, Edmond S. K.
author_sort Au, Chun Hang
collection PubMed
description Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper.
format Online
Article
Text
id pubmed-5431517
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-54315172017-05-16 BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing Au, Chun Hang Ho, Dona N. Kwong, Ava Chan, Tsun Leung Ma, Edmond S. K. Sci Rep Article Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper. Nature Publishing Group UK 2017-05-08 /pmc/articles/PMC5431517/ /pubmed/28484262 http://dx.doi.org/10.1038/s41598-017-01703-6 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Au, Chun Hang
Ho, Dona N.
Kwong, Ava
Chan, Tsun Leung
Ma, Edmond S. K.
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_full BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_fullStr BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_full_unstemmed BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_short BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_sort bamclipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5431517/
https://www.ncbi.nlm.nih.gov/pubmed/28484262
http://dx.doi.org/10.1038/s41598-017-01703-6
work_keys_str_mv AT auchunhang bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT hodonan bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT kwongava bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT chantsunleung bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT maedmondsk bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing