Cargando…
BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort
SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853765/ https://www.ncbi.nlm.nih.gov/pubmed/31197312 http://dx.doi.org/10.1093/bioinformatics/btz479 |
_version_ | 1783470095549333504 |
---|---|
author | Chun, Hein Kim, Sangwoo |
author_facet | Chun, Hein Kim, Sangwoo |
author_sort | Chun, Hein |
collection | PubMed |
description | SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample mismatches from a given BAM file cohort with minimal user intervention. BAMixChecker uses a flexible, data-specific set of single-nucleotide polymorphisms and detects orphan (unpaired) and swapped (mispaired) samples based on genotype-concordance score and entropy-based file name analysis. BAMixChecker shows ∼100% accuracy in real WES, RNA-Seq and targeted sequencing data cohorts, even for small panels (<50 genes). BAMixChecker provides an HTML-style report that graphically outlines the sample matching status in tables and heatmaps, with which users can quickly inspect any mismatch events. AVAILABILITY AND IMPLEMENTATION: BAMixChecker is available at https://github.com/heinc1010/BAMixChecker SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6853765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68537652019-11-19 BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort Chun, Hein Kim, Sangwoo Bioinformatics Applications Notes SUMMARY: Mislabeling in the process of next generation sequencing is a frequent problem that can cause an entire genomic analysis to fail, and a regular cohort-level checkup is needed to ensure that it has not occurred. We developed a new, automated tool (BAMixChecker) that accurately detects sample mismatches from a given BAM file cohort with minimal user intervention. BAMixChecker uses a flexible, data-specific set of single-nucleotide polymorphisms and detects orphan (unpaired) and swapped (mispaired) samples based on genotype-concordance score and entropy-based file name analysis. BAMixChecker shows ∼100% accuracy in real WES, RNA-Seq and targeted sequencing data cohorts, even for small panels (<50 genes). BAMixChecker provides an HTML-style report that graphically outlines the sample matching status in tables and heatmaps, with which users can quickly inspect any mismatch events. AVAILABILITY AND IMPLEMENTATION: BAMixChecker is available at https://github.com/heinc1010/BAMixChecker SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-15 2019-06-14 /pmc/articles/PMC6853765/ /pubmed/31197312 http://dx.doi.org/10.1093/bioinformatics/btz479 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Chun, Hein Kim, Sangwoo BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title | BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title_full | BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title_fullStr | BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title_full_unstemmed | BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title_short | BAMixChecker: an automated checkup tool for matched sample pairs in NGS cohort |
title_sort | bamixchecker: an automated checkup tool for matched sample pairs in ngs cohort |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853765/ https://www.ncbi.nlm.nih.gov/pubmed/31197312 http://dx.doi.org/10.1093/bioinformatics/btz479 |
work_keys_str_mv | AT chunhein bamixcheckeranautomatedcheckuptoolformatchedsamplepairsinngscohort AT kimsangwoo bamixcheckeranautomatedcheckuptoolformatchedsamplepairsinngscohort |