Cargando…

Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context

As the use of next-generation sequencing (NGS) for the Mendelian diseases diagnosis is expanding, the performance of this method has to be improved in order to achieve higher quality. Typically, performance measures are considered to be designed in the context of each application and, therefore, acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Ivanov, Maxim, Ivanov, Mikhail, Kasianov, Artem, Rozhavskaya, Ekaterina, Musienko, Sergey, Baranova, Ancha, Mileyko, Vladislav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868350/
https://www.ncbi.nlm.nih.gov/pubmed/31511888
http://dx.doi.org/10.1093/nar/gkz775
_version_ 1783472240694657024
author Ivanov, Maxim
Ivanov, Mikhail
Kasianov, Artem
Rozhavskaya, Ekaterina
Musienko, Sergey
Baranova, Ancha
Mileyko, Vladislav
author_facet Ivanov, Maxim
Ivanov, Mikhail
Kasianov, Artem
Rozhavskaya, Ekaterina
Musienko, Sergey
Baranova, Ancha
Mileyko, Vladislav
author_sort Ivanov, Maxim
collection PubMed
description As the use of next-generation sequencing (NGS) for the Mendelian diseases diagnosis is expanding, the performance of this method has to be improved in order to achieve higher quality. Typically, performance measures are considered to be designed in the context of each application and, therefore, account for a spectrum of clinically relevant variants. We present EphaGen, a new computational methodology for bioinformatics quality control (QC). Given a single NGS dataset in BAM format and a pre-compiled VCF-file of targeted clinically relevant variants it associates this dataset with a single arbiter parameter. Intrinsically, EphaGen estimates the probability to miss any variant from the defined spectrum within a particular NGS dataset. Such performance measure virtually resembles the diagnostic sensitivity of given NGS dataset. Here we present case studies of the use of EphaGen in context of BRCA1/2 and CFTR sequencing in a series of 14 runs across 43 blood samples and 504 publically available NGS datasets. EphaGen is superior to conventional bioinformatics metrics such as coverage depth and coverage uniformity. We recommend using this software as a QC step in NGS studies in the clinical context. Availability: https://github.com/m4merg/EphaGen or https://hub.docker.com/r/m4merg/ephagen.
format Online
Article
Text
id pubmed-6868350
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68683502019-11-27 Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context Ivanov, Maxim Ivanov, Mikhail Kasianov, Artem Rozhavskaya, Ekaterina Musienko, Sergey Baranova, Ancha Mileyko, Vladislav Nucleic Acids Res Methods Online As the use of next-generation sequencing (NGS) for the Mendelian diseases diagnosis is expanding, the performance of this method has to be improved in order to achieve higher quality. Typically, performance measures are considered to be designed in the context of each application and, therefore, account for a spectrum of clinically relevant variants. We present EphaGen, a new computational methodology for bioinformatics quality control (QC). Given a single NGS dataset in BAM format and a pre-compiled VCF-file of targeted clinically relevant variants it associates this dataset with a single arbiter parameter. Intrinsically, EphaGen estimates the probability to miss any variant from the defined spectrum within a particular NGS dataset. Such performance measure virtually resembles the diagnostic sensitivity of given NGS dataset. Here we present case studies of the use of EphaGen in context of BRCA1/2 and CFTR sequencing in a series of 14 runs across 43 blood samples and 504 publically available NGS datasets. EphaGen is superior to conventional bioinformatics metrics such as coverage depth and coverage uniformity. We recommend using this software as a QC step in NGS studies in the clinical context. Availability: https://github.com/m4merg/EphaGen or https://hub.docker.com/r/m4merg/ephagen. Oxford University Press 2019-12-02 2019-09-12 /pmc/articles/PMC6868350/ /pubmed/31511888 http://dx.doi.org/10.1093/nar/gkz775 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Ivanov, Maxim
Ivanov, Mikhail
Kasianov, Artem
Rozhavskaya, Ekaterina
Musienko, Sergey
Baranova, Ancha
Mileyko, Vladislav
Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title_full Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title_fullStr Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title_full_unstemmed Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title_short Novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
title_sort novel bioinformatics quality control metric for next-generation sequencing experiments in the clinical context
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868350/
https://www.ncbi.nlm.nih.gov/pubmed/31511888
http://dx.doi.org/10.1093/nar/gkz775
work_keys_str_mv AT ivanovmaxim novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT ivanovmikhail novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT kasianovartem novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT rozhavskayaekaterina novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT musienkosergey novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT baranovaancha novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext
AT mileykovladislav novelbioinformaticsqualitycontrolmetricfornextgenerationsequencingexperimentsintheclinicalcontext