Cargando…

SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples

BACKGROUND: Artifact chimeric reads are enriched in next-generation sequencing data generated from formalin-fixed paraffin-embedded (FFPE) samples. Previous work indicated that these reads are characterized by erroneous split-read support that is interpreted as evidence of structural variants. Thus,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Lanying, Dugas, Martin, Sandmann, Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458033/
https://www.ncbi.nlm.nih.gov/pubmed/34553214
http://dx.doi.org/10.1093/gigascience/giab065
_version_ 1784571235532800000
author Wei, Lanying
Dugas, Martin
Sandmann, Sarah
author_facet Wei, Lanying
Dugas, Martin
Sandmann, Sarah
author_sort Wei, Lanying
collection PubMed
description BACKGROUND: Artifact chimeric reads are enriched in next-generation sequencing data generated from formalin-fixed paraffin-embedded (FFPE) samples. Previous work indicated that these reads are characterized by erroneous split-read support that is interpreted as evidence of structural variants. Thus, a large number of false-positive structural variants are detected. To our knowledge, no tool is currently available to specifically call or filter structural variants in FFPE samples. To overcome this gap, we developed 2 R packages: SimFFPE and FilterFFPE. RESULTS: SimFFPE is a read simulator, specifically designed for next-generation sequencing data from FFPE samples. A mixture of characteristic artifact chimeric reads, as well as normal reads, is generated. FilterFFPE is a filtration algorithm, removing artifact chimeric reads from sequencing data while keeping real chimeric reads. To evaluate the performance of FilterFFPE, we performed structural variant calling with 3 common tools (Delly, Lumpy, and Manta) with and without prior filtration with FilterFFPE. After applying FilterFFPE, the mean positive predictive value improved from 0.27 to 0.48 in simulated samples and from 0.11 to 0.27 in real samples, while sensitivity remained basically unchanged or even slightly increased. CONCLUSIONS: FilterFFPE improves the performance of SV calling in FFPE samples. It was validated by analysis of simulated and real data.
format Online
Article
Text
id pubmed-8458033
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84580332021-09-23 SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples Wei, Lanying Dugas, Martin Sandmann, Sarah Gigascience Technical Note BACKGROUND: Artifact chimeric reads are enriched in next-generation sequencing data generated from formalin-fixed paraffin-embedded (FFPE) samples. Previous work indicated that these reads are characterized by erroneous split-read support that is interpreted as evidence of structural variants. Thus, a large number of false-positive structural variants are detected. To our knowledge, no tool is currently available to specifically call or filter structural variants in FFPE samples. To overcome this gap, we developed 2 R packages: SimFFPE and FilterFFPE. RESULTS: SimFFPE is a read simulator, specifically designed for next-generation sequencing data from FFPE samples. A mixture of characteristic artifact chimeric reads, as well as normal reads, is generated. FilterFFPE is a filtration algorithm, removing artifact chimeric reads from sequencing data while keeping real chimeric reads. To evaluate the performance of FilterFFPE, we performed structural variant calling with 3 common tools (Delly, Lumpy, and Manta) with and without prior filtration with FilterFFPE. After applying FilterFFPE, the mean positive predictive value improved from 0.27 to 0.48 in simulated samples and from 0.11 to 0.27 in real samples, while sensitivity remained basically unchanged or even slightly increased. CONCLUSIONS: FilterFFPE improves the performance of SV calling in FFPE samples. It was validated by analysis of simulated and real data. Oxford University Press 2021-09-22 /pmc/articles/PMC8458033/ /pubmed/34553214 http://dx.doi.org/10.1093/gigascience/giab065 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Wei, Lanying
Dugas, Martin
Sandmann, Sarah
SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title_full SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title_fullStr SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title_full_unstemmed SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title_short SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples
title_sort simffpe and filterffpe: improving structural variant calling in ffpe samples
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458033/
https://www.ncbi.nlm.nih.gov/pubmed/34553214
http://dx.doi.org/10.1093/gigascience/giab065
work_keys_str_mv AT weilanying simffpeandfilterffpeimprovingstructuralvariantcallinginffpesamples
AT dugasmartin simffpeandfilterffpeimprovingstructuralvariantcallinginffpesamples
AT sandmannsarah simffpeandfilterffpeimprovingstructuralvariantcallinginffpesamples