Cargando…
Data stream dataset of SARS-CoV-2 genome
As of May 25, 2020, the novel coronavirus disease (called COVID-19) spread to more than 185 countries/regions with more than 348,000 deaths and more than 5,550,000 confirmed cases. In the bioinformatics area, one of the crucial points is the analysis of the virus nucleotide sequences using approache...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7306612/ https://www.ncbi.nlm.nih.gov/pubmed/32596428 http://dx.doi.org/10.1016/j.dib.2020.105829 |
_version_ | 1783548690786418688 |
---|---|
author | Barbosa, Raquel de M. Fernandes, Marcelo A.C. |
author_facet | Barbosa, Raquel de M. Fernandes, Marcelo A.C. |
author_sort | Barbosa, Raquel de M. |
collection | PubMed |
description | As of May 25, 2020, the novel coronavirus disease (called COVID-19) spread to more than 185 countries/regions with more than 348,000 deaths and more than 5,550,000 confirmed cases. In the bioinformatics area, one of the crucial points is the analysis of the virus nucleotide sequences using approaches such as data stream techniques and algorithms. However, to make feasible this approach, it is necessary to transform the nucleotide sequences string to numerical stream representation. Thus, the dataset provides four kinds of data stream representation (DSR) of SARS-CoV-2 virus nucleotide sequences. The dataset provides the DSR of 1557 instances of SARS-CoV-2 virus, 11540 other instances of other viruses from the Virus-Host DB dataset, and three instances of Riboviria viruses from NCBI (Betacoronavirus RaTG13, bat-SL-CoVZC45, and bat-SL-CoVZXC21). |
format | Online Article Text |
id | pubmed-7306612 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-73066122020-06-25 Data stream dataset of SARS-CoV-2 genome Barbosa, Raquel de M. Fernandes, Marcelo A.C. Data Brief Biochemistry, Genetics and Molecular Biology As of May 25, 2020, the novel coronavirus disease (called COVID-19) spread to more than 185 countries/regions with more than 348,000 deaths and more than 5,550,000 confirmed cases. In the bioinformatics area, one of the crucial points is the analysis of the virus nucleotide sequences using approaches such as data stream techniques and algorithms. However, to make feasible this approach, it is necessary to transform the nucleotide sequences string to numerical stream representation. Thus, the dataset provides four kinds of data stream representation (DSR) of SARS-CoV-2 virus nucleotide sequences. The dataset provides the DSR of 1557 instances of SARS-CoV-2 virus, 11540 other instances of other viruses from the Virus-Host DB dataset, and three instances of Riboviria viruses from NCBI (Betacoronavirus RaTG13, bat-SL-CoVZC45, and bat-SL-CoVZXC21). Elsevier 2020-06-10 /pmc/articles/PMC7306612/ /pubmed/32596428 http://dx.doi.org/10.1016/j.dib.2020.105829 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Biochemistry, Genetics and Molecular Biology Barbosa, Raquel de M. Fernandes, Marcelo A.C. Data stream dataset of SARS-CoV-2 genome |
title | Data stream dataset of SARS-CoV-2 genome |
title_full | Data stream dataset of SARS-CoV-2 genome |
title_fullStr | Data stream dataset of SARS-CoV-2 genome |
title_full_unstemmed | Data stream dataset of SARS-CoV-2 genome |
title_short | Data stream dataset of SARS-CoV-2 genome |
title_sort | data stream dataset of sars-cov-2 genome |
topic | Biochemistry, Genetics and Molecular Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7306612/ https://www.ncbi.nlm.nih.gov/pubmed/32596428 http://dx.doi.org/10.1016/j.dib.2020.105829 |
work_keys_str_mv | AT barbosaraqueldem datastreamdatasetofsarscov2genome AT fernandesmarceloac datastreamdatasetofsarscov2genome |