Cargando…
Rapid and sensitive detection of genome contamination at scale with FCS-GX
Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artific...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10246020/ https://www.ncbi.nlm.nih.gov/pubmed/37292984 http://dx.doi.org/10.1101/2023.06.02.543519 |
_version_ | 1785054962807472128 |
---|---|
author | Astashyn, Alexander Tvedte, Eric S. Sweeney, Deacon Sapojnikov, Victor Bouk, Nathan Joukov, Victor Mozes, Eyal Strope, Pooja K. Sylla, Pape M. Wagner, Lukas Bidwell, Shelby L. Clark, Karen Davis, Emily W. Smith-White, Brian Hlavina, Wratko Pruitt, Kim D. Schneider, Valerie A. Murphy, Terence D. |
author_facet | Astashyn, Alexander Tvedte, Eric S. Sweeney, Deacon Sapojnikov, Victor Bouk, Nathan Joukov, Victor Mozes, Eyal Strope, Pooja K. Sylla, Pape M. Wagner, Lukas Bidwell, Shelby L. Clark, Karen Davis, Emily W. Smith-White, Brian Hlavina, Wratko Pruitt, Kim D. Schneider, Valerie A. Murphy, Terence D. |
author_sort | Astashyn, Alexander |
collection | PubMed |
description | Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/. |
format | Online Article Text |
id | pubmed-10246020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-102460202023-06-08 Rapid and sensitive detection of genome contamination at scale with FCS-GX Astashyn, Alexander Tvedte, Eric S. Sweeney, Deacon Sapojnikov, Victor Bouk, Nathan Joukov, Victor Mozes, Eyal Strope, Pooja K. Sylla, Pape M. Wagner, Lukas Bidwell, Shelby L. Clark, Karen Davis, Emily W. Smith-White, Brian Hlavina, Wratko Pruitt, Kim D. Schneider, Valerie A. Murphy, Terence D. bioRxiv Article Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI’s Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1–10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/. Cold Spring Harbor Laboratory 2023-06-06 /pmc/articles/PMC10246020/ /pubmed/37292984 http://dx.doi.org/10.1101/2023.06.02.543519 Text en https://creativecommons.org/publicdomain/zero/1.0/This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license (https://creativecommons.org/publicdomain/zero/1.0/) . |
spellingShingle | Article Astashyn, Alexander Tvedte, Eric S. Sweeney, Deacon Sapojnikov, Victor Bouk, Nathan Joukov, Victor Mozes, Eyal Strope, Pooja K. Sylla, Pape M. Wagner, Lukas Bidwell, Shelby L. Clark, Karen Davis, Emily W. Smith-White, Brian Hlavina, Wratko Pruitt, Kim D. Schneider, Valerie A. Murphy, Terence D. Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title | Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title_full | Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title_fullStr | Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title_full_unstemmed | Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title_short | Rapid and sensitive detection of genome contamination at scale with FCS-GX |
title_sort | rapid and sensitive detection of genome contamination at scale with fcs-gx |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10246020/ https://www.ncbi.nlm.nih.gov/pubmed/37292984 http://dx.doi.org/10.1101/2023.06.02.543519 |
work_keys_str_mv | AT astashynalexander rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT tvedteerics rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT sweeneydeacon rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT sapojnikovvictor rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT bouknathan rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT joukovvictor rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT mozeseyal rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT stropepoojak rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT syllapapem rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT wagnerlukas rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT bidwellshelbyl rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT clarkkaren rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT davisemilyw rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT smithwhitebrian rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT hlavinawratko rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT pruittkimd rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT schneidervaleriea rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx AT murphyterenced rapidandsensitivedetectionofgenomecontaminationatscalewithfcsgx |