Cargando…

The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance

Next generation sequencing (NGS) is routinely used in clinical genetic testing. Quality management of NGS testing is essential to ensure performance is consistently and rigorously evaluated. Three primary metrics are used in NGS quality evaluation: depth of coverage, base quality and mapping quality...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahamdallie, Shazia, Ruark, Elise, Yost, Shawn, Münz, Márton, Renwick, Anthony, Poyastro-Pearson, Emma, Strydom, Ann, Seal, Sheila, Rahman, Nazneen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6020721/
https://www.ncbi.nlm.nih.gov/pubmed/29992192
http://dx.doi.org/10.12688/wellcomeopenres.14307.1
_version_ 1783335353777651712
author Mahamdallie, Shazia
Ruark, Elise
Yost, Shawn
Münz, Márton
Renwick, Anthony
Poyastro-Pearson, Emma
Strydom, Ann
Seal, Sheila
Rahman, Nazneen
author_facet Mahamdallie, Shazia
Ruark, Elise
Yost, Shawn
Münz, Márton
Renwick, Anthony
Poyastro-Pearson, Emma
Strydom, Ann
Seal, Sheila
Rahman, Nazneen
author_sort Mahamdallie, Shazia
collection PubMed
description Next generation sequencing (NGS) is routinely used in clinical genetic testing. Quality management of NGS testing is essential to ensure performance is consistently and rigorously evaluated. Three primary metrics are used in NGS quality evaluation: depth of coverage, base quality and mapping quality. To provide consistency and transparency in the utilisation of these metrics we present the Quality Sequencing Minimum (QSM). The QSM defines the minimum quality requirement a laboratory has selected for depth of coverage (C), base quality (B) and mapping quality (M) and can be applied per base, exon, gene or other genomic region, as appropriate. The QSM format is CX_BY(P (Y))_MZ(P (Z)). X is the parameter threshold for C, Y the parameter threshold for B, P (Y) the percentage of reads that must reach Y, Z the parameter threshold for M, P (Z) the percentage of reads that must reach Z. The data underlying the QSM is in the BAM file, so a QSM can be easily and automatically calculated in any NGS pipeline. We used the QSM to optimise cancer predisposition gene testing using the TruSight Cancer Panel (TSCP). We set the QSM as C50_B10(85)_M20(95). Test regions falling below the QSM were automatically flagged for review, with 100/1471 test regions QSM-flagged in multiple individuals. Supplementing these regions with 132 additional probes improved performance in 85/100. We also used the QSM to optimise testing of genes with pseudogenes such as PTEN and PMS2. In TSCP data from 960 individuals the median number of regions that passed QSM per sample was 1429 (97%).  Importantly, the QSM can be used at an individual report level to provide succinct, comprehensive quality assurance information about individual test performance. We believe many laboratories would find the QSM useful. Furthermore, widespread adoption of the QSM would facilitate consistent, transparent reporting of genetic test performance by different laboratories.
format Online
Article
Text
id pubmed-6020721
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-60207212018-07-09 The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance Mahamdallie, Shazia Ruark, Elise Yost, Shawn Münz, Márton Renwick, Anthony Poyastro-Pearson, Emma Strydom, Ann Seal, Sheila Rahman, Nazneen Wellcome Open Res Method Article Next generation sequencing (NGS) is routinely used in clinical genetic testing. Quality management of NGS testing is essential to ensure performance is consistently and rigorously evaluated. Three primary metrics are used in NGS quality evaluation: depth of coverage, base quality and mapping quality. To provide consistency and transparency in the utilisation of these metrics we present the Quality Sequencing Minimum (QSM). The QSM defines the minimum quality requirement a laboratory has selected for depth of coverage (C), base quality (B) and mapping quality (M) and can be applied per base, exon, gene or other genomic region, as appropriate. The QSM format is CX_BY(P (Y))_MZ(P (Z)). X is the parameter threshold for C, Y the parameter threshold for B, P (Y) the percentage of reads that must reach Y, Z the parameter threshold for M, P (Z) the percentage of reads that must reach Z. The data underlying the QSM is in the BAM file, so a QSM can be easily and automatically calculated in any NGS pipeline. We used the QSM to optimise cancer predisposition gene testing using the TruSight Cancer Panel (TSCP). We set the QSM as C50_B10(85)_M20(95). Test regions falling below the QSM were automatically flagged for review, with 100/1471 test regions QSM-flagged in multiple individuals. Supplementing these regions with 132 additional probes improved performance in 85/100. We also used the QSM to optimise testing of genes with pseudogenes such as PTEN and PMS2. In TSCP data from 960 individuals the median number of regions that passed QSM per sample was 1429 (97%).  Importantly, the QSM can be used at an individual report level to provide succinct, comprehensive quality assurance information about individual test performance. We believe many laboratories would find the QSM useful. Furthermore, widespread adoption of the QSM would facilitate consistent, transparent reporting of genetic test performance by different laboratories. F1000 Research Limited 2018-04-04 /pmc/articles/PMC6020721/ /pubmed/29992192 http://dx.doi.org/10.12688/wellcomeopenres.14307.1 Text en Copyright: © 2018 Mahamdallie S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Mahamdallie, Shazia
Ruark, Elise
Yost, Shawn
Münz, Márton
Renwick, Anthony
Poyastro-Pearson, Emma
Strydom, Ann
Seal, Sheila
Rahman, Nazneen
The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title_full The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title_fullStr The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title_full_unstemmed The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title_short The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
title_sort quality sequencing minimum (qsm): providing comprehensive, consistent, transparent next generation sequencing  data quality assurance
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6020721/
https://www.ncbi.nlm.nih.gov/pubmed/29992192
http://dx.doi.org/10.12688/wellcomeopenres.14307.1
work_keys_str_mv AT mahamdallieshazia thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT ruarkelise thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT yostshawn thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT munzmarton thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT renwickanthony thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT poyastropearsonemma thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT strydomann thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT sealsheila thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT rahmannazneen thequalitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT mahamdallieshazia qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT ruarkelise qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT yostshawn qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT munzmarton qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT renwickanthony qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT poyastropearsonemma qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT strydomann qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT sealsheila qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance
AT rahmannazneen qualitysequencingminimumqsmprovidingcomprehensiveconsistenttransparentnextgenerationsequencingdataqualityassurance