Cargando…

Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels

The importance of next generation sequencing (NGS) rises in cancer research as accessing this key technology becomes easier for researchers. The sequence data created by NGS technologies must be processed by various bioinformatics algorithms within a pipeline in order to convert raw data to meaningf...

Descripción completa

Detalles Bibliográficos
Autores principales: Kısakol, Batuhan, Sarıhan, Şahin, Ergün, Mehmet Arif, Baysan, Mehmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Scientific and Technological Research Council of Turkey 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068765/
https://www.ncbi.nlm.nih.gov/pubmed/33907494
http://dx.doi.org/10.3906/biy-2008-8
_version_ 1783683079304380416
author Kısakol, Batuhan
Sarıhan, Şahin
Ergün, Mehmet Arif
Baysan, Mehmet
author_facet Kısakol, Batuhan
Sarıhan, Şahin
Ergün, Mehmet Arif
Baysan, Mehmet
author_sort Kısakol, Batuhan
collection PubMed
description The importance of next generation sequencing (NGS) rises in cancer research as accessing this key technology becomes easier for researchers. The sequence data created by NGS technologies must be processed by various bioinformatics algorithms within a pipeline in order to convert raw data to meaningful information. Mapping and variant calling are the two main steps of these analysis pipelines, and many algorithms are available for these steps. Therefore, detailed benchmarking of these algorithms in different scenarios is crucial for the efficient utilization of sequencing technologies. In this study, we compared the performance of twelve pipelines (three mapping and four variant discovery algorithms) with recommended settings to capture single nucleotide variants. We observed significant discrepancy in variant calls among tested pipelines for different heterogeneity levels in real and simulated samples with overall high specificity and low sensitivity. Additional to the individual evaluation of pipelines, we also constructed and tested the performance of pipeline combinations. In these analyses, we observed that certain pipelines complement each other much better than others and display superior performance than individual pipelines. This suggests that adhering to a single pipeline is not optimal for cancer sequencing analysis and sample heterogeneity should be considered in algorithm optimization.
format Online
Article
Text
id pubmed-8068765
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Scientific and Technological Research Council of Turkey
record_format MEDLINE/PubMed
spelling pubmed-80687652021-04-26 Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels Kısakol, Batuhan Sarıhan, Şahin Ergün, Mehmet Arif Baysan, Mehmet Turk J Biol Article The importance of next generation sequencing (NGS) rises in cancer research as accessing this key technology becomes easier for researchers. The sequence data created by NGS technologies must be processed by various bioinformatics algorithms within a pipeline in order to convert raw data to meaningful information. Mapping and variant calling are the two main steps of these analysis pipelines, and many algorithms are available for these steps. Therefore, detailed benchmarking of these algorithms in different scenarios is crucial for the efficient utilization of sequencing technologies. In this study, we compared the performance of twelve pipelines (three mapping and four variant discovery algorithms) with recommended settings to capture single nucleotide variants. We observed significant discrepancy in variant calls among tested pipelines for different heterogeneity levels in real and simulated samples with overall high specificity and low sensitivity. Additional to the individual evaluation of pipelines, we also constructed and tested the performance of pipeline combinations. In these analyses, we observed that certain pipelines complement each other much better than others and display superior performance than individual pipelines. This suggests that adhering to a single pipeline is not optimal for cancer sequencing analysis and sample heterogeneity should be considered in algorithm optimization. The Scientific and Technological Research Council of Turkey 2021-04-20 /pmc/articles/PMC8068765/ /pubmed/33907494 http://dx.doi.org/10.3906/biy-2008-8 Text en Copyright © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Article
Kısakol, Batuhan
Sarıhan, Şahin
Ergün, Mehmet Arif
Baysan, Mehmet
Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title_full Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title_fullStr Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title_full_unstemmed Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title_short Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
title_sort detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068765/
https://www.ncbi.nlm.nih.gov/pubmed/33907494
http://dx.doi.org/10.3906/biy-2008-8
work_keys_str_mv AT kısakolbatuhan detailedevaluationofcancersequencingpipelinesindifferentmicroenvironmentsandheterogeneitylevels
AT sarıhansahin detailedevaluationofcancersequencingpipelinesindifferentmicroenvironmentsandheterogeneitylevels
AT ergunmehmetarif detailedevaluationofcancersequencingpipelinesindifferentmicroenvironmentsandheterogeneitylevels
AT baysanmehmet detailedevaluationofcancersequencingpipelinesindifferentmicroenvironmentsandheterogeneitylevels