Cargando…

Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer

Phylogenetic trees provide insight into the evolutionary trajectories of species and molecules. However, because (2n-5)! Phylogenetic trees can be constructed from a dataset containing n sequences, but this method of phylogenetic tree construction is not ideal from the viewpoint of a combinatorial e...

Descripción completa

Detalles Bibliográficos
Autores principales: Onodera, Wataru, Hara, Nobuyuki, Aoki, Shiho, Asahi, Toru, Sawamura, Naoya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9978462/
https://www.ncbi.nlm.nih.gov/pubmed/36875213
http://dx.doi.org/10.1016/j.dib.2023.108970
_version_ 1784899526233948160
author Onodera, Wataru
Hara, Nobuyuki
Aoki, Shiho
Asahi, Toru
Sawamura, Naoya
author_facet Onodera, Wataru
Hara, Nobuyuki
Aoki, Shiho
Asahi, Toru
Sawamura, Naoya
author_sort Onodera, Wataru
collection PubMed
description Phylogenetic trees provide insight into the evolutionary trajectories of species and molecules. However, because (2n-5)! Phylogenetic trees can be constructed from a dataset containing n sequences, but this method of phylogenetic tree construction is not ideal from the viewpoint of a combinatorial explosion to determine the optimal tree using brute force. Therefore, we developed a method for constructing a phylogenetic tree using a Fujitsu Digital Annealer, a quantum-inspired computer that solves combinatorial optimization problems at a high speed. Specifically, phylogenetic trees are generated by repeating the process of partitioning a set of sequences into two parts (i.e., the graph-cut problem). Here, the optimality of the solution (normalized cut value) obtained by the proposed method was compared with the existing methods using simulated and real data. The simulation dataset contained 32–3200 sequences, and the average branch length according to a normal distribution or the Yule model ranged from 0.125 to 0.750, covering a wide range of sequence diversity. In addition, the statistical information of the dataset is described in terms of two indices: transitivity and average p-distance. As phylogenetic tree construction methods are expected to continue to improve, we believe that this dataset can be used as a reference for comparison and confirmation of the validity of the results. Further interpretation of these analyses is explained in W. Onodera, N. Hara, S. Aoki, T. Asahi, N. Sawamura, Phylogenetic tree reconstruction via graph cut presented using a quantum-inspired computer, Mol. Phylogenet. Evol. 178 (2023) 107636.
format Online
Article
Text
id pubmed-9978462
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-99784622023-03-03 Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer Onodera, Wataru Hara, Nobuyuki Aoki, Shiho Asahi, Toru Sawamura, Naoya Data Brief Data Article Phylogenetic trees provide insight into the evolutionary trajectories of species and molecules. However, because (2n-5)! Phylogenetic trees can be constructed from a dataset containing n sequences, but this method of phylogenetic tree construction is not ideal from the viewpoint of a combinatorial explosion to determine the optimal tree using brute force. Therefore, we developed a method for constructing a phylogenetic tree using a Fujitsu Digital Annealer, a quantum-inspired computer that solves combinatorial optimization problems at a high speed. Specifically, phylogenetic trees are generated by repeating the process of partitioning a set of sequences into two parts (i.e., the graph-cut problem). Here, the optimality of the solution (normalized cut value) obtained by the proposed method was compared with the existing methods using simulated and real data. The simulation dataset contained 32–3200 sequences, and the average branch length according to a normal distribution or the Yule model ranged from 0.125 to 0.750, covering a wide range of sequence diversity. In addition, the statistical information of the dataset is described in terms of two indices: transitivity and average p-distance. As phylogenetic tree construction methods are expected to continue to improve, we believe that this dataset can be used as a reference for comparison and confirmation of the validity of the results. Further interpretation of these analyses is explained in W. Onodera, N. Hara, S. Aoki, T. Asahi, N. Sawamura, Phylogenetic tree reconstruction via graph cut presented using a quantum-inspired computer, Mol. Phylogenet. Evol. 178 (2023) 107636. Elsevier 2023-02-13 /pmc/articles/PMC9978462/ /pubmed/36875213 http://dx.doi.org/10.1016/j.dib.2023.108970 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Onodera, Wataru
Hara, Nobuyuki
Aoki, Shiho
Asahi, Toru
Sawamura, Naoya
Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title_full Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title_fullStr Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title_full_unstemmed Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title_short Data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
title_sort data on the solution and processing time reached when constructing a phylogenetic tree using a quantum-inspired computer
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9978462/
https://www.ncbi.nlm.nih.gov/pubmed/36875213
http://dx.doi.org/10.1016/j.dib.2023.108970
work_keys_str_mv AT onoderawataru dataonthesolutionandprocessingtimereachedwhenconstructingaphylogenetictreeusingaquantuminspiredcomputer
AT haranobuyuki dataonthesolutionandprocessingtimereachedwhenconstructingaphylogenetictreeusingaquantuminspiredcomputer
AT aokishiho dataonthesolutionandprocessingtimereachedwhenconstructingaphylogenetictreeusingaquantuminspiredcomputer
AT asahitoru dataonthesolutionandprocessingtimereachedwhenconstructingaphylogenetictreeusingaquantuminspiredcomputer
AT sawamuranaoya dataonthesolutionandprocessingtimereachedwhenconstructingaphylogenetictreeusingaquantuminspiredcomputer