Cargando…
LAPIS is a fast web API for massive open virus sequencing data
BACKGROUND: Recent epidemic outbreaks such as the SARS-CoV-2 pandemic and the mpox outbreak in 2022 have demonstrated the value of genomic sequencing data for tracking the origin and spread of pathogens. Laboratories around the globe generated new sequences at unprecedented speed and volume and bioi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240112/ https://www.ncbi.nlm.nih.gov/pubmed/37277732 http://dx.doi.org/10.1186/s12859-023-05364-3 |
_version_ | 1785053674626613248 |
---|---|
author | Chen, Chaoran Taepper, Alexander Engelniederhammer, Fabian Kellerer, Jonas Roemer, Cornelius Stadler, Tanja |
author_facet | Chen, Chaoran Taepper, Alexander Engelniederhammer, Fabian Kellerer, Jonas Roemer, Cornelius Stadler, Tanja |
author_sort | Chen, Chaoran |
collection | PubMed |
description | BACKGROUND: Recent epidemic outbreaks such as the SARS-CoV-2 pandemic and the mpox outbreak in 2022 have demonstrated the value of genomic sequencing data for tracking the origin and spread of pathogens. Laboratories around the globe generated new sequences at unprecedented speed and volume and bioinformaticians developed new tools and dashboards to analyze this wealth of data. However, a major challenge that remains is the lack of simple and efficient approaches for accessing and processing sequencing data. RESULTS: The Lightweight API for Sequences (LAPIS) facilitates rapid retrieval and analysis of genomic sequencing data through a REST API. It supports complex mutation- and metadata-based queries and can perform aggregation operations on massive datasets. LAPIS is optimized for typical questions relevant to genomic epidemiology. Using a newly-developed in-memory database engine, it has a high speed and throughput: between 25 January and 4 February 2023, the SARS-CoV-2 instance of LAPIS, which contains 14.5 million sequences, processed over 20 million requests with a mean response time of 411 ms and a median response time of 1 ms. LAPIS is the core engine behind our dashboards on genspectrum.org and we currently maintain public LAPIS instances for SARS-CoV-2 and mpox. CONCLUSIONS: Powered by an optimized database engine and available through a web API, LAPIS enhances the accessibility of genomic sequencing data. It is designed to serve as a common backend for dashboards and analyses with the potential to be integrated into common database platforms such as GenBank. |
format | Online Article Text |
id | pubmed-10240112 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102401122023-06-06 LAPIS is a fast web API for massive open virus sequencing data Chen, Chaoran Taepper, Alexander Engelniederhammer, Fabian Kellerer, Jonas Roemer, Cornelius Stadler, Tanja BMC Bioinformatics Software BACKGROUND: Recent epidemic outbreaks such as the SARS-CoV-2 pandemic and the mpox outbreak in 2022 have demonstrated the value of genomic sequencing data for tracking the origin and spread of pathogens. Laboratories around the globe generated new sequences at unprecedented speed and volume and bioinformaticians developed new tools and dashboards to analyze this wealth of data. However, a major challenge that remains is the lack of simple and efficient approaches for accessing and processing sequencing data. RESULTS: The Lightweight API for Sequences (LAPIS) facilitates rapid retrieval and analysis of genomic sequencing data through a REST API. It supports complex mutation- and metadata-based queries and can perform aggregation operations on massive datasets. LAPIS is optimized for typical questions relevant to genomic epidemiology. Using a newly-developed in-memory database engine, it has a high speed and throughput: between 25 January and 4 February 2023, the SARS-CoV-2 instance of LAPIS, which contains 14.5 million sequences, processed over 20 million requests with a mean response time of 411 ms and a median response time of 1 ms. LAPIS is the core engine behind our dashboards on genspectrum.org and we currently maintain public LAPIS instances for SARS-CoV-2 and mpox. CONCLUSIONS: Powered by an optimized database engine and available through a web API, LAPIS enhances the accessibility of genomic sequencing data. It is designed to serve as a common backend for dashboards and analyses with the potential to be integrated into common database platforms such as GenBank. BioMed Central 2023-06-05 /pmc/articles/PMC10240112/ /pubmed/37277732 http://dx.doi.org/10.1186/s12859-023-05364-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Chen, Chaoran Taepper, Alexander Engelniederhammer, Fabian Kellerer, Jonas Roemer, Cornelius Stadler, Tanja LAPIS is a fast web API for massive open virus sequencing data |
title | LAPIS is a fast web API for massive open virus sequencing data |
title_full | LAPIS is a fast web API for massive open virus sequencing data |
title_fullStr | LAPIS is a fast web API for massive open virus sequencing data |
title_full_unstemmed | LAPIS is a fast web API for massive open virus sequencing data |
title_short | LAPIS is a fast web API for massive open virus sequencing data |
title_sort | lapis is a fast web api for massive open virus sequencing data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240112/ https://www.ncbi.nlm.nih.gov/pubmed/37277732 http://dx.doi.org/10.1186/s12859-023-05364-3 |
work_keys_str_mv | AT chenchaoran lapisisafastwebapiformassiveopenvirussequencingdata AT taepperalexander lapisisafastwebapiformassiveopenvirussequencingdata AT engelniederhammerfabian lapisisafastwebapiformassiveopenvirussequencingdata AT kellererjonas lapisisafastwebapiformassiveopenvirussequencingdata AT roemercornelius lapisisafastwebapiformassiveopenvirussequencingdata AT stadlertanja lapisisafastwebapiformassiveopenvirussequencingdata |