Cargando…

RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Olyaee, Mohammad Hossein, Pirgazi, Jamshid, Khalifeh, Khosrow, Khanteymoori, Alireza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411429/
https://www.ncbi.nlm.nih.gov/pubmed/32835040
http://dx.doi.org/10.1016/j.dib.2020.106144
_version_ 1783568377352028160
author Olyaee, Mohammad Hossein
Pirgazi, Jamshid
Khalifeh, Khosrow
Khanteymoori, Alireza
author_facet Olyaee, Mohammad Hossein
Pirgazi, Jamshid
Khalifeh, Khosrow
Khanteymoori, Alireza
author_sort Olyaee, Mohammad Hossein
collection PubMed
description Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identify distinctive evolutionary/mutant markers. Utilizing chaos game representation (CGR) as well as recurrence quantification analysis (RQA) as a powerful nonlinear analysis technique, we proposed an effective process to extract several valuable features from genomic sequences of SARS-CoV-2. The represented features enable us to compare genomic sequences with different lengths. The provided dataset involves totally 18 RQA-based features for 4496 instances of SARS-CoV-2.
format Online
Article
Text
id pubmed-7411429
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-74114292020-08-07 RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation Olyaee, Mohammad Hossein Pirgazi, Jamshid Khalifeh, Khosrow Khanteymoori, Alireza Data Brief Biochemistry, Genetics and Molecular Biology Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the COVID-19 pandemic. It was first detected in China and was rapidly spread to other countries. Several thousands of whole genome sequences of SARS-CoV-2 have been reported and it is important to compare them and identify distinctive evolutionary/mutant markers. Utilizing chaos game representation (CGR) as well as recurrence quantification analysis (RQA) as a powerful nonlinear analysis technique, we proposed an effective process to extract several valuable features from genomic sequences of SARS-CoV-2. The represented features enable us to compare genomic sequences with different lengths. The provided dataset involves totally 18 RQA-based features for 4496 instances of SARS-CoV-2. Elsevier 2020-08-07 /pmc/articles/PMC7411429/ /pubmed/32835040 http://dx.doi.org/10.1016/j.dib.2020.106144 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Biochemistry, Genetics and Molecular Biology
Olyaee, Mohammad Hossein
Pirgazi, Jamshid
Khalifeh, Khosrow
Khanteymoori, Alireza
RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title_full RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title_fullStr RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title_full_unstemmed RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title_short RCOVID19: Recurrence-based SARS-CoV-2 features using chaos game representation
title_sort rcovid19: recurrence-based sars-cov-2 features using chaos game representation
topic Biochemistry, Genetics and Molecular Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411429/
https://www.ncbi.nlm.nih.gov/pubmed/32835040
http://dx.doi.org/10.1016/j.dib.2020.106144
work_keys_str_mv AT olyaeemohammadhossein rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation
AT pirgazijamshid rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation
AT khalifehkhosrow rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation
AT khanteymoorialireza rcovid19recurrencebasedsarscov2featuresusingchaosgamerepresentation