Cargando…

Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus

INTRODUCTION: Next‐generation sequencing (NGS) has several advantages over conventional Sanger sequencing for HIV drug resistance (HIVDR) genotyping, including detection and quantitation of low‐abundance variants bearing drug resistance mutations (DRMs). However, the high HIV genomic diversity, unpr...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Hezhao, Enns, Eric, Brumme, Chanson J., Parkin, Neil, Howison, Mark, Lee, Emma R., Capina, Rupert, Marinier, Eric, Avila‐Rios, Santiago, Sandstrom, Paul, Van Domselaar, Gary, Harrigan, Richard, Paredes, Roger, Kantor, Rami, Noguera‐Julian, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198166/
https://www.ncbi.nlm.nih.gov/pubmed/30350345
http://dx.doi.org/10.1002/jia2.25193
_version_ 1783364910697152512
author Ji, Hezhao
Enns, Eric
Brumme, Chanson J.
Parkin, Neil
Howison, Mark
Lee, Emma R.
Capina, Rupert
Marinier, Eric
Avila‐Rios, Santiago
Sandstrom, Paul
Van Domselaar, Gary
Harrigan, Richard
Paredes, Roger
Kantor, Rami
Noguera‐Julian, Marc
author_facet Ji, Hezhao
Enns, Eric
Brumme, Chanson J.
Parkin, Neil
Howison, Mark
Lee, Emma R.
Capina, Rupert
Marinier, Eric
Avila‐Rios, Santiago
Sandstrom, Paul
Van Domselaar, Gary
Harrigan, Richard
Paredes, Roger
Kantor, Rami
Noguera‐Julian, Marc
author_sort Ji, Hezhao
collection PubMed
description INTRODUCTION: Next‐generation sequencing (NGS) has several advantages over conventional Sanger sequencing for HIV drug resistance (HIVDR) genotyping, including detection and quantitation of low‐abundance variants bearing drug resistance mutations (DRMs). However, the high HIV genomic diversity, unprecedented large volume of data, complexity of analysis and potential for error pose significant challenges for data processing. Several NGS analysis pipelines have been developed and used in HIVDR research; however, the absence of uniformity in data processing strategies results in lack of consistency and comparability of outputs from different pipelines. To fill this gap, an international symposium on bioinformatic strategies for NGS‐based HIVDR testing was held in February 2018 in Winnipeg, Canada, convening laboratory scientists, bioinformaticians and clinicians involved in four recently developed, publicly available NGS HIVDR pipelines. The goal of this symposium was to establish a consensus on effective bioinformatic strategies for NGS data management and its use for HIVDR reporting. DISCUSSION: Essential functionalities of an NGS HIVDR pipeline were divided into five analytic blocks: (1) NGS read quality control (QC)/quality assurance (QA); (2) NGS read alignment and reference mapping; (3) HIV variant calling and variant QC; (4) NGS HIVDR reporting; and (5) extended data applications and additional considerations for data management. The consensuses reached among the participants on all major aspects of these blocks are summarized here. They encompass not only recommended data management and analysis strategies, but also detailed bioinformatic approaches that help ensure accuracy of the derived HIVDR analysis outputs for both research and potential clinical use. CONCLUSIONS: While NGS is being adopted more broadly in HIVDR testing laboratories, data processing is often a bottleneck hindering its generalized application. The proposed standardization of NGS read QC/QA, read alignment and reference mapping, variant calling and QC, HIVDR reporting and relevant data management strategies in this “Winnipeg Consensus” may serve as a starting guideline for NGS HIVDR data processing that informs the refinement of existing pipelines and those yet to be developed. Moreover, the bioinformatic strategies presented here may apply more broadly to NGS data analysis of microbes harbouring significant genomic diversity.
format Online
Article
Text
id pubmed-6198166
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-61981662018-10-31 Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus Ji, Hezhao Enns, Eric Brumme, Chanson J. Parkin, Neil Howison, Mark Lee, Emma R. Capina, Rupert Marinier, Eric Avila‐Rios, Santiago Sandstrom, Paul Van Domselaar, Gary Harrigan, Richard Paredes, Roger Kantor, Rami Noguera‐Julian, Marc J Int AIDS Soc Commentary INTRODUCTION: Next‐generation sequencing (NGS) has several advantages over conventional Sanger sequencing for HIV drug resistance (HIVDR) genotyping, including detection and quantitation of low‐abundance variants bearing drug resistance mutations (DRMs). However, the high HIV genomic diversity, unprecedented large volume of data, complexity of analysis and potential for error pose significant challenges for data processing. Several NGS analysis pipelines have been developed and used in HIVDR research; however, the absence of uniformity in data processing strategies results in lack of consistency and comparability of outputs from different pipelines. To fill this gap, an international symposium on bioinformatic strategies for NGS‐based HIVDR testing was held in February 2018 in Winnipeg, Canada, convening laboratory scientists, bioinformaticians and clinicians involved in four recently developed, publicly available NGS HIVDR pipelines. The goal of this symposium was to establish a consensus on effective bioinformatic strategies for NGS data management and its use for HIVDR reporting. DISCUSSION: Essential functionalities of an NGS HIVDR pipeline were divided into five analytic blocks: (1) NGS read quality control (QC)/quality assurance (QA); (2) NGS read alignment and reference mapping; (3) HIV variant calling and variant QC; (4) NGS HIVDR reporting; and (5) extended data applications and additional considerations for data management. The consensuses reached among the participants on all major aspects of these blocks are summarized here. They encompass not only recommended data management and analysis strategies, but also detailed bioinformatic approaches that help ensure accuracy of the derived HIVDR analysis outputs for both research and potential clinical use. CONCLUSIONS: While NGS is being adopted more broadly in HIVDR testing laboratories, data processing is often a bottleneck hindering its generalized application. The proposed standardization of NGS read QC/QA, read alignment and reference mapping, variant calling and QC, HIVDR reporting and relevant data management strategies in this “Winnipeg Consensus” may serve as a starting guideline for NGS HIVDR data processing that informs the refinement of existing pipelines and those yet to be developed. Moreover, the bioinformatic strategies presented here may apply more broadly to NGS data analysis of microbes harbouring significant genomic diversity. John Wiley and Sons Inc. 2018-10-22 /pmc/articles/PMC6198166/ /pubmed/30350345 http://dx.doi.org/10.1002/jia2.25193 Text en © 2018 The Authors and Her Majesty the Queen in Right of Canada. Journal of the International AIDS Society published by John Wiley & Sons Ltd on behalf of the International AIDS Society. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Commentary
Ji, Hezhao
Enns, Eric
Brumme, Chanson J.
Parkin, Neil
Howison, Mark
Lee, Emma R.
Capina, Rupert
Marinier, Eric
Avila‐Rios, Santiago
Sandstrom, Paul
Van Domselaar, Gary
Harrigan, Richard
Paredes, Roger
Kantor, Rami
Noguera‐Julian, Marc
Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title_full Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title_fullStr Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title_full_unstemmed Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title_short Bioinformatic data processing pipelines in support of next‐generation sequencing‐based HIV drug resistance testing: the Winnipeg Consensus
title_sort bioinformatic data processing pipelines in support of next‐generation sequencing‐based hiv drug resistance testing: the winnipeg consensus
topic Commentary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198166/
https://www.ncbi.nlm.nih.gov/pubmed/30350345
http://dx.doi.org/10.1002/jia2.25193
work_keys_str_mv AT jihezhao bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT ennseric bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT brummechansonj bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT parkinneil bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT howisonmark bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT leeemmar bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT capinarupert bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT mariniereric bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT avilariossantiago bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT sandstrompaul bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT vandomselaargary bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT harriganrichard bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT paredesroger bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT kantorrami bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus
AT noguerajulianmarc bioinformaticdataprocessingpipelinesinsupportofnextgenerationsequencingbasedhivdrugresistancetestingthewinnipegconsensus