Cargando…
Learning to Unknot
We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their kn...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/2632-2153/abe91f http://cds.cern.ch/record/2744999 |
_version_ | 1780968670532468736 |
---|---|
author | Gukov, Sergei Halverson, James Ruehle, Fabian Sułkowski, Piotr |
author_facet | Gukov, Sergei Halverson, James Ruehle, Fabian Sułkowski, Piotr |
author_sort | Gukov, Sergei |
collection | CERN |
description | We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform well. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well for a wide range of crossing numbers and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves. |
id | cern-2744999 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2020 |
record_format | invenio |
spelling | cern-27449992021-12-08T16:20:14Zdoi:10.1088/2632-2153/abe91fhttp://cds.cern.ch/record/2744999engGukov, SergeiHalverson, JamesRuehle, FabianSułkowski, PiotrLearning to Unknothep-thParticle Physics - Theorycs.LGComputing and Computersmath.GTMathematical Physics and MathematicsWe introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform well. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well for a wide range of crossing numbers and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate N-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform at 95% accuracy. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well, reducing 80% of the unknots with up to 96 crossings we tested to the empty braid word, and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.arXiv:2010.16263CALT-2020-046CERN-TH-2020-179oai:cds.cern.ch:27449992020-10-28 |
spellingShingle | hep-th Particle Physics - Theory cs.LG Computing and Computers math.GT Mathematical Physics and Mathematics Gukov, Sergei Halverson, James Ruehle, Fabian Sułkowski, Piotr Learning to Unknot |
title | Learning to Unknot |
title_full | Learning to Unknot |
title_fullStr | Learning to Unknot |
title_full_unstemmed | Learning to Unknot |
title_short | Learning to Unknot |
title_sort | learning to unknot |
topic | hep-th Particle Physics - Theory cs.LG Computing and Computers math.GT Mathematical Physics and Mathematics |
url | https://dx.doi.org/10.1088/2632-2153/abe91f http://cds.cern.ch/record/2744999 |
work_keys_str_mv | AT gukovsergei learningtounknot AT halversonjames learningtounknot AT ruehlefabian learningtounknot AT sułkowskipiotr learningtounknot |