Cargando…

BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort

SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample...

Descripción completa

Detalles Bibliográficos
Autores principales: Chun, Hein, Kim, Sangwoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853765/
https://www.ncbi.nlm.nih.gov/pubmed/31197312
http://dx.doi.org/10.1093/bioinformatics/btz479
_version_ 1783470095549333504
author Chun, Hein
Kim, Sangwoo
author_facet Chun, Hein
Kim, Sangwoo
author_sort Chun, Hein
collection PubMed
description SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample mismatches from a given BAM file cohort with minimal user intervention. BAMixChecker uses a flexible, data-specific set of single-nucleotide polymorphisms and detects orphan (unpaired) and swapped (mispaired) samples based on genotype-concordance score and entropy-based file name analysis. BAMixChecker shows ∼100% accuracy in real WES, RNA-Seq and targeted sequencing data cohorts, even for small panels (<50 genes). BAMixChecker provides an HTML-style report that graphically outlines the sample matching status in tables and heatmaps, with which users can quickly inspect any mismatch events. AVAILABILITY AND IMPLEMENTATION: BAMixChecker is available at https://github.com/heinc1010/BAMixChecker SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6853765
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68537652019-11-19 BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort Chun, Hein Kim, Sangwoo Bioinformatics Applications Notes SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample mismatches from a given BAM file cohort with minimal user intervention. BAMixChecker uses a flexible, data-specific set of single-nucleotide polymorphisms and detects orphan (unpaired) and swapped (mispaired) samples based on genotype-concordance score and entropy-based file name analysis. BAMixChecker shows ∼100% accuracy in real WES, RNA-Seq and targeted sequencing data cohorts, even for small panels (<50 genes). BAMixChecker provides an HTML-style report that graphically outlines the sample matching status in tables and heatmaps, with which users can quickly inspect any mismatch events. AVAILABILITY AND IMPLEMENTATION: BAMixChecker is available at https://github.com/heinc1010/BAMixChecker SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-15 2019-06-14 /pmc/articles/PMC6853765/ /pubmed/31197312 http://dx.doi.org/10.1093/bioinformatics/btz479 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Chun, Hein
Kim, Sangwoo
BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title_full BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title_fullStr BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title_full_unstemmed BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title_short BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
title_sort bamixchecker: an automated checkup tool for matched sample pairs in ngs cohort
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853765/
https://www.ncbi.nlm.nih.gov/pubmed/31197312
http://dx.doi.org/10.1093/bioinformatics/btz479
work_keys_str_mv AT chunhein bamixcheckeranautomatedcheckuptoolformatchedsamplepairsinngscohort
AT kimsangwoo bamixcheckeranautomatedcheckuptoolformatchedsamplepairsinngscohort