Cargando…
Comparative analyses of error handling strategies for next-generation sequencing in precision medicine
Next-generation sequencing (NGS) offers the opportunity to sequence millions and billions of DNA sequences in a short period, leading to novel applications in personalized medicine, such as cancer diagnostics or antiviral therapy. Nevertheless, sequencing technologies have different error rates, whi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113248/ https://www.ncbi.nlm.nih.gov/pubmed/32238883 http://dx.doi.org/10.1038/s41598-020-62675-8 |
_version_ | 1783513631084773376 |
---|---|
author | F. Löchel, Hannah Heider, Dominik |
author_facet | F. Löchel, Hannah Heider, Dominik |
author_sort | F. Löchel, Hannah |
collection | PubMed |
description | Next-generation sequencing (NGS) offers the opportunity to sequence millions and billions of DNA sequences in a short period, leading to novel applications in personalized medicine, such as cancer diagnostics or antiviral therapy. Nevertheless, sequencing technologies have different error rates, which occur during the sequencing process. If the NGS data is used for diagnostics, these sequences with errors are typically neglected or a worst-case scenario is assumed. In the current study, we focused on the impact of ambiguous bases on therapy recommendations for Human Immunodeficiency Virus 1 (HIV-1) patients. Concretely, we analyzed the treatment recommendation with entry blockers based on prediction models for co-receptor tropism. We compared three different error handling strategies that have been used in the literature, namely (i) neglection, (ii) worst-case assumption, and (iii) deconvolution with a majority vote. We could show that for two or more ambiguous positions per sequence a reliable prediction is generally no longer possible. Moreover, also the position of ambiguity plays a crucial role. Thus, we analyzed the error probability distributions of existing sequencing technologies, e.g., Illumina MiSeq or PacBio, with respect to the aforementioned error handling strategies and it turned out that neglection outperforms the other strategies in the case where no systematic errors are present. In other cases, the deconvolution strategy with the majority vote should be preferred. |
format | Online Article Text |
id | pubmed-7113248 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-71132482020-04-06 Comparative analyses of error handling strategies for next-generation sequencing in precision medicine F. Löchel, Hannah Heider, Dominik Sci Rep Article Next-generation sequencing (NGS) offers the opportunity to sequence millions and billions of DNA sequences in a short period, leading to novel applications in personalized medicine, such as cancer diagnostics or antiviral therapy. Nevertheless, sequencing technologies have different error rates, which occur during the sequencing process. If the NGS data is used for diagnostics, these sequences with errors are typically neglected or a worst-case scenario is assumed. In the current study, we focused on the impact of ambiguous bases on therapy recommendations for Human Immunodeficiency Virus 1 (HIV-1) patients. Concretely, we analyzed the treatment recommendation with entry blockers based on prediction models for co-receptor tropism. We compared three different error handling strategies that have been used in the literature, namely (i) neglection, (ii) worst-case assumption, and (iii) deconvolution with a majority vote. We could show that for two or more ambiguous positions per sequence a reliable prediction is generally no longer possible. Moreover, also the position of ambiguity plays a crucial role. Thus, we analyzed the error probability distributions of existing sequencing technologies, e.g., Illumina MiSeq or PacBio, with respect to the aforementioned error handling strategies and it turned out that neglection outperforms the other strategies in the case where no systematic errors are present. In other cases, the deconvolution strategy with the majority vote should be preferred. Nature Publishing Group UK 2020-04-01 /pmc/articles/PMC7113248/ /pubmed/32238883 http://dx.doi.org/10.1038/s41598-020-62675-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article F. Löchel, Hannah Heider, Dominik Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title | Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title_full | Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title_fullStr | Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title_full_unstemmed | Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title_short | Comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
title_sort | comparative analyses of error handling strategies for next-generation sequencing in precision medicine |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113248/ https://www.ncbi.nlm.nih.gov/pubmed/32238883 http://dx.doi.org/10.1038/s41598-020-62675-8 |
work_keys_str_mv | AT flochelhannah comparativeanalysesoferrorhandlingstrategiesfornextgenerationsequencinginprecisionmedicine AT heiderdominik comparativeanalysesoferrorhandlingstrategiesfornextgenerationsequencinginprecisionmedicine |