Cargando…

Rapid and sensitive detection of genome contamination at scale with FCS-GX

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artific...

Descripción completa

Detalles Bibliográficos
Autores principales: Astashyn, Alexander, Tvedte, Eric S., Sweeney, Deacon, Sapojnikov, Victor, Bouk, Nathan, Joukov, Victor, Mozes, Eyal, Strope, Pooja K., Sylla, Pape M., Wagner, Lukas, Bidwell, Shelby L., Clark, Karen, Davis, Emily W., Smith-White, Brian, Hlavina, Wratko, Pruitt, Kim D., Schneider, Valerie A., Murphy, Terence D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10246020/
https://www.ncbi.nlm.nih.gov/pubmed/37292984
http://dx.doi.org/10.1101/2023.06.02.543519
_version_ 1785054962807472128
author Astashyn, Alexander
Tvedte, Eric S.
Sweeney, Deacon
Sapojnikov, Victor
Bouk, Nathan
Joukov, Victor
Mozes, Eyal
Strope, Pooja K.
Sylla, Pape M.
Wagner, Lukas
Bidwell, Shelby L.
Clark, Karen
Davis, Emily W.
Smith-White, Brian
Hlavina, Wratko
Pruitt, Kim D.
Schneider, Valerie A.
Murphy, Terence D.
author_facet Astashyn, Alexander
Tvedte, Eric S.
Sweeney, Deacon
Sapojnikov, Victor
Bouk, Nathan
Joukov, Victor
Mozes, Eyal
Strope, Pooja K.
Sylla, Pape M.
Wagner, Lukas
Bidwell, Shelby L.
Clark, Karen
Davis, Emily W.
Smith-White, Brian
Hlavina, Wratko
Pruitt, Kim D.
Schneider, Valerie A.
Murphy, Terence D.
author_sort Astashyn, Alexander
collection PubMed
description Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/.
format Online
Article
Text
id pubmed-10246020
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-102460202023-06-08 Rapid and sensitive detection of genome contamination at scale with FCS-GX Astashyn, Alexander Tvedte, Eric S. Sweeney, Deacon Sapojnikov, Victor Bouk, Nathan Joukov, Victor Mozes, Eyal Strope, Pooja K. Sylla, Pape M. Wagner, Lukas Bidwell, Shelby L. Clark, Karen Davis, Emily W. Smith-White, Brian Hlavina, Wratko Pruitt, Kim D. Schneider, Valerie A. Murphy, Terence D. bioRxiv Article Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/. Cold Spring Harbor Laboratory 2023-06-06 /pmc/articles/PMC10246020/ /pubmed/37292984 http://dx.doi.org/10.1101/2023.06.02.543519 Text en https://creativecommons.org/publicdomain/zero/1.0/This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license (https://creativecommons.org/publicdomain/zero/1.0/) .
spellingShingle Article
Astashyn, Alexander
Tvedte, Eric S.
Sweeney, Deacon
Sapojnikov, Victor
Bouk, Nathan
Joukov, Victor
Mozes, Eyal
Strope, Pooja K.
Sylla, Pape M.
Wagner, Lukas
Bidwell, Shelby L.
Clark, Karen
Davis, Emily W.
Smith-White, Brian
Hlavina, Wratko
Pruitt, Kim D.
Schneider, Valerie A.
Murphy, Terence D.
Rapid and sensitive detection of genome contamination at scale with FCS-GX
title Rapid and sensitive detection of genome contamination at scale with FCS-GX
title_full Rapid and sensitive detection of genome contamination at scale with FCS-GX
title_fullStr Rapid and sensitive detection of genome contamination at scale with FCS-GX
title_full_unstemmed Rapid and sensitive detection of genome contamination at scale with FCS-GX
title_short Rapid and sensitive detection of genome contamination at scale with FCS-GX
title_sort rapid and sensitive detection of genome contamination at scale with fcs-gx
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10246020/
https://www.ncbi.nlm.nih.gov/pubmed/37292984
http://dx.doi.org/10.1101/2023.06.02.543519
work_keys_str_mv AT astashynalexander rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT tvedteerics rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT sweeneydeacon rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT sapojnikovvictor rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT bouknathan rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT joukovvictor rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT mozeseyal rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT stropepoojak rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT syllapapem rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT wagnerlukas rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT bidwellshelbyl rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT clarkkaren rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT davisemilyw rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT smithwhitebrian rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT hlavinawratko rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT pruittkimd rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT schneidervaleriea rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx
AT murphyterenced rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx