Cargando…
RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411429/ https://www.ncbi.nlm.nih.gov/pubmed/32835040 http://dx.doi.org/10.1016/j.dib.2020.106144 |
_version_ | 1783568377352028160 |
---|---|
author | Olyaee, Mohammad Hossein Pirgazi, Jamshid Khalifeh, Khosrow Khanteymoori, Alireza |
author_facet | Olyaee, Mohammad Hossein Pirgazi, Jamshid Khalifeh, Khosrow Khanteymoori, Alireza |
author_sort | Olyaee, Mohammad Hossein |
collection | PubMed |
description | Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identify distinctive evolutionary/mutant markers. Utilizing chaos game representation (CGR) as well as recurrence quantification analysis (RQA) as a powerful nonlinear analysis technique, we proposed an effective process to extract several valuable features from genomic sequences of SARS-CoV-2. The represented features enable us to compare genomic sequences with different lengths. The provided dataset involves totally 18 RQA-based features for 4496 instances of SARS-CoV-2. |
format | Online Article Text |
id | pubmed-7411429 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-74114292020-08-07 RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation Olyaee, Mohammad Hossein Pirgazi, Jamshid Khalifeh, Khosrow Khanteymoori, Alireza Data Brief Biochemistry, Genetics and Molecular Biology Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identify distinctive evolutionary/mutant markers. Utilizing chaos game representation (CGR) as well as recurrence quantification analysis (RQA) as a powerful nonlinear analysis technique, we proposed an effective process to extract several valuable features from genomic sequences of SARS-CoV-2. The represented features enable us to compare genomic sequences with different lengths. The provided dataset involves totally 18 RQA-based features for 4496 instances of SARS-CoV-2. Elsevier 2020-08-07 /pmc/articles/PMC7411429/ /pubmed/32835040 http://dx.doi.org/10.1016/j.dib.2020.106144 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Biochemistry, Genetics and Molecular Biology Olyaee, Mohammad Hossein Pirgazi, Jamshid Khalifeh, Khosrow Khanteymoori, Alireza RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title | RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title_full | RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title_fullStr | RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title_full_unstemmed | RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title_short | RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation |
title_sort | rcovid19: recurrence-based sars-cov-2 features using chaos game representation |
topic | Biochemistry, Genetics and Molecular Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411429/ https://www.ncbi.nlm.nih.gov/pubmed/32835040 http://dx.doi.org/10.1016/j.dib.2020.106144 |
work_keys_str_mv | AT olyaeemohammadhossein rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation AT pirgazijamshid rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation AT khalifehkhosrow rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation AT khanteymoorialireza rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation |