Cargando…

From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software

Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several indep...

Descripción completa

Detalles Bibliográficos
Autores principales: Forster, Michael, Forster, Peter, Elsharawy, Abdou, Hemmrich, Georg, Kreck, Benjamin, Wittig, Michael, Thomsen, Ingo, Stade, Björn, Barann, Matthias, Ellinghaus, David, Petersen, Britt-Sabina, May, Sandra, Melum, Espen, Schilhabel, Markus B., Keller, Andreas, Schreiber, Stefan, Rosenstiel, Philip, Franke, Andre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592472/
https://www.ncbi.nlm.nih.gov/pubmed/22965131
http://dx.doi.org/10.1093/nar/gks836
_version_ 1782262123745771520
author Forster, Michael
Forster, Peter
Elsharawy, Abdou
Hemmrich, Georg
Kreck, Benjamin
Wittig, Michael
Thomsen, Ingo
Stade, Björn
Barann, Matthias
Ellinghaus, David
Petersen, Britt-Sabina
May, Sandra
Melum, Espen
Schilhabel, Markus B.
Keller, Andreas
Schreiber, Stefan
Rosenstiel, Philip
Franke, Andre
author_facet Forster, Michael
Forster, Peter
Elsharawy, Abdou
Hemmrich, Georg
Kreck, Benjamin
Wittig, Michael
Thomsen, Ingo
Stade, Björn
Barann, Matthias
Ellinghaus, David
Petersen, Britt-Sabina
May, Sandra
Melum, Espen
Schilhabel, Markus B.
Keller, Andreas
Schreiber, Stefan
Rosenstiel, Philip
Franke, Andre
author_sort Forster, Michael
collection PubMed
description Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several independent SNV-callers. We here demonstrate that 0.5–60% of relevant SNVs might not be detected due to coverage gaps, or might be misidentified. Even low error rates can overwhelm the true biological signal, especially in clinical diagnostics, in research comparing healthy with affected cells, in archaeogenetic dating or in forensics. For these reasons, we have developed a package called pibase, which is applicable to diploid and haploid genome, exome or targeted enrichment data. pibase extracts details on nucleotides from alignment files at user-specified coordinates and identifies reproducible genotypes, if present. In test cases pibase identifies genotypes at 99.98% specificity, 10-fold better than other tools. pibase also provides pair-wise comparisons between healthy and affected cells using nucleotide signals (10-fold more accurately than a genotype-based approach, as we show in our case study of monozygotic twins). This comparison tool also solves the problem of detecting allelic imbalance within heterozygous SNVs in copy number variation loci, or in heterogeneous tumor sequences.
format Online
Article
Text
id pubmed-3592472
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35924722013-03-08 From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software Forster, Michael Forster, Peter Elsharawy, Abdou Hemmrich, Georg Kreck, Benjamin Wittig, Michael Thomsen, Ingo Stade, Björn Barann, Matthias Ellinghaus, David Petersen, Britt-Sabina May, Sandra Melum, Espen Schilhabel, Markus B. Keller, Andreas Schreiber, Stefan Rosenstiel, Philip Franke, Andre Nucleic Acids Res Methods Online Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several independent SNV-callers. We here demonstrate that 0.5–60% of relevant SNVs might not be detected due to coverage gaps, or might be misidentified. Even low error rates can overwhelm the true biological signal, especially in clinical diagnostics, in research comparing healthy with affected cells, in archaeogenetic dating or in forensics. For these reasons, we have developed a package called pibase, which is applicable to diploid and haploid genome, exome or targeted enrichment data. pibase extracts details on nucleotides from alignment files at user-specified coordinates and identifies reproducible genotypes, if present. In test cases pibase identifies genotypes at 99.98% specificity, 10-fold better than other tools. pibase also provides pair-wise comparisons between healthy and affected cells using nucleotide signals (10-fold more accurately than a genotype-based approach, as we show in our case study of monozygotic twins). This comparison tool also solves the problem of detecting allelic imbalance within heterozygous SNVs in copy number variation loci, or in heterogeneous tumor sequences. Oxford University Press 2013-01 2012-09-08 /pmc/articles/PMC3592472/ /pubmed/22965131 http://dx.doi.org/10.1093/nar/gks836 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Forster, Michael
Forster, Peter
Elsharawy, Abdou
Hemmrich, Georg
Kreck, Benjamin
Wittig, Michael
Thomsen, Ingo
Stade, Björn
Barann, Matthias
Ellinghaus, David
Petersen, Britt-Sabina
May, Sandra
Melum, Espen
Schilhabel, Markus B.
Keller, Andreas
Schreiber, Stefan
Rosenstiel, Philip
Franke, Andre
From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title_full From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title_fullStr From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title_full_unstemmed From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title_short From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
title_sort from next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592472/
https://www.ncbi.nlm.nih.gov/pubmed/22965131
http://dx.doi.org/10.1093/nar/gks836
work_keys_str_mv AT forstermichael fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT forsterpeter fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT elsharawyabdou fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT hemmrichgeorg fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT kreckbenjamin fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT wittigmichael fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT thomseningo fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT stadebjorn fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT barannmatthias fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT ellinghausdavid fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT petersenbrittsabina fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT maysandra fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT melumespen fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT schilhabelmarkusb fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT kellerandreas fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT schreiberstefan fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT rosenstielphilip fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware
AT frankeandre fromnextgenerationsequencingalignmentstoaccuratecomparisonandvalidationofsinglenucleotidevariantsthepibasesoftware