Cargando…

Learning to Unknot

We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their kn...

Descripción completa

Detalles Bibliográficos
Autores principales: Gukov, Sergei, Halverson, James, Ruehle, Fabian, Sułkowski, Piotr
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1088/2632-2153/abe91f
http://cds.cern.ch/record/2744999
_version_ 1780968670532468736
author Gukov, Sergei
Halverson, James
Ruehle, Fabian
Sułkowski, Piotr
author_facet Gukov, Sergei
Halverson, James
Ruehle, Fabian
Sułkowski, Piotr
author_sort Gukov, Sergei
collection CERN
description We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform well. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well for a wide range of crossing numbers and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.
id cern-2744999
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling cern-27449992021-12-08T16:20:14Zdoi:10.1088/2632-2153/abe91fhttp://cds.cern.ch/record/2744999engGukov, SergeiHalverson, JamesRuehle, FabianSułkowski, PiotrLearning to Unknothep-thParticle Physics - Theorycs.LGComputing and Computersmath.GTMathematical Physics and MathematicsWe introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate $N$-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform well. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well for a wide range of crossing numbers and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate N-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform at 95% accuracy. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well, reducing 80% of the unknots with up to 96 crossings we tested to the empty braid word, and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.arXiv:2010.16263CALT-2020-046CERN-TH-2020-179oai:cds.cern.ch:27449992020-10-28
spellingShingle hep-th
Particle Physics - Theory
cs.LG
Computing and Computers
math.GT
Mathematical Physics and Mathematics
Gukov, Sergei
Halverson, James
Ruehle, Fabian
Sułkowski, Piotr
Learning to Unknot
title Learning to Unknot
title_full Learning to Unknot
title_fullStr Learning to Unknot
title_full_unstemmed Learning to Unknot
title_short Learning to Unknot
title_sort learning to unknot
topic hep-th
Particle Physics - Theory
cs.LG
Computing and Computers
math.GT
Mathematical Physics and Mathematics
url https://dx.doi.org/10.1088/2632-2153/abe91f
http://cds.cern.ch/record/2744999
work_keys_str_mv AT gukovsergei learningtounknot
AT halversonjames learningtounknot
AT ruehlefabian learningtounknot
AT sułkowskipiotr learningtounknot