Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sam Blackwell

The unknotting number, hard unknot diagrams, and reinforcement learning

Sep 13, 2024

Taylor Applebaum, Sam Blackwell, Alex Davies, Thomas Edlich, András Juhász, Marc Lackenby, Nenad Tomašev, Daniel Zheng

Figure 1 for The unknotting number, hard unknot diagrams, and reinforcement learning

Figure 2 for The unknotting number, hard unknot diagrams, and reinforcement learning

Figure 3 for The unknotting number, hard unknot diagrams, and reinforcement learning

Figure 4 for The unknotting number, hard unknot diagrams, and reinforcement learning

Abstract:We have developed a reinforcement learning agent that often finds a minimal sequence of unknotting crossing changes for a knot diagram with up to 200 crossings, hence giving an upper bound on the unknotting number. We have used this to determine the unknotting number of 57k knots. We took diagrams of connected sums of such knots with oppositely signed signatures, where the summands were overlaid. The agent has found examples where several of the crossing changes in an unknotting collection of crossings result in hyperbolic knots. Based on this, we have shown that, given knots $K$ and $K'$ that satisfy some mild assumptions, there is a diagram of their connected sum and $u(K) + u(K')$ unknotting crossings such that changing any one of them results in a prime knot. As a by-product, we have obtained a dataset of 2.6 million distinct hard unknot diagrams; most of them under 35 crossings. Assuming the additivity of the unknotting number, we have determined the unknotting number of 43 at most 12-crossing knots for which the unknotting number is unknown.

* 29 pages, 17 figures

Via

Access Paper or Ask Questions

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Nov 06, 2023

Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova(+9 more)

Figure 1 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Figure 2 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Figure 3 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Figure 4 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Abstract:This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the number of edges without having 3- or 4-cycles. We formulate this problem as a sequential decision-making problem and compare AlphaZero, a neural network-guided tree search, with tabu search, a heuristic local search method. Using either method, by introducing a curriculum -- jump-starting the search for larger graphs using good graphs found at smaller sizes -- we improve the state-of-the-art lower bounds for several sizes. We also propose a flexible graph-generation environment and a permutation-invariant network architecture for learning to search in the space of graphs.

* Accepted at MATH AI workshop at NeurIPS 2023, First three authors contributed equally, Last two authors have equal senior contribution

Via

Access Paper or Ask Questions

Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

Oct 09, 2023

Johannes Bausch, Andrew W Senior, Francisco J H Heras, Thomas Edlich, Alex Davies, Michael Newman, Cody Jones, Kevin Satzinger, Murphy Yuezhen Niu, Sam Blackwell(+8 more)

Figure 1 for Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

Figure 2 for Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

Figure 3 for Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

Figure 4 for Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

Abstract:Quantum error-correction is a prerequisite for reliable quantum computation. Towards this goal, we present a recurrent, transformer-based neural network which learns to decode the surface code, the leading quantum error-correction code. Our decoder outperforms state-of-the-art algorithmic decoders on real-world data from Google's Sycamore quantum processor for distance 3 and 5 surface codes. On distances up to 11, the decoder maintains its advantage on simulated data with realistic noise including cross-talk, leakage, and analog readout signals, and sustains its accuracy far beyond the 25 cycles it was trained on. Our work illustrates the ability of machine learning to go beyond human-designed algorithms by learning from data directly, highlighting machine learning as a strong contender for decoding in quantum computers.

Via

Access Paper or Ask Questions

A Deep Learning Approach for Characterizing Major Galaxy Mergers

Feb 09, 2021

Skanda Koppula, Victor Bapst, Marc Huertas-Company, Sam Blackwell, Agnieszka Grabska-Barwinska, Sander Dieleman, Andrea Huber, Natasha Antropova, Mikolaj Binkowski, Hannah Openshaw(+8 more)

Figure 1 for A Deep Learning Approach for Characterizing Major Galaxy Mergers

Figure 2 for A Deep Learning Approach for Characterizing Major Galaxy Mergers

Figure 3 for A Deep Learning Approach for Characterizing Major Galaxy Mergers

Abstract:Fine-grained estimation of galaxy merger stages from observations is a key problem useful for validation of our current theoretical understanding of galaxy formation. To this end, we demonstrate a CNN-based regression model that is able to predict, for the first time, using a single image, the merger stage relative to the first perigee passage with a median error of 38.3 million years (Myrs) over a period of 400 Myrs. This model uses no specific dynamical modeling and learns only from simulated merger events. We show that our model provides reasonable estimates on real observations, approximately matching prior estimates provided by detailed dynamical modeling. We provide a preliminary interpretability analysis of our models, and demonstrate first steps toward calibrated uncertainty estimation.

* Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

Via

Access Paper or Ask Questions

Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Sep 12, 2018

Stanislav Nikolov, Sam Blackwell, Ruheena Mendes, Jeffrey De Fauw, Clemens Meyer, Cían Hughes, Harry Askham, Bernardino Romera-Paredes, Alan Karthikesalingam, Carlton Chu(+13 more)

Figure 1 for Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Figure 2 for Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Figure 3 for Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Figure 4 for Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Abstract:Over half a million individuals are diagnosed with head and neck cancer each year worldwide. Radiotherapy is an important curative treatment for this disease, but it requires manually intensive delineation of radiosensitive organs at risk (OARs). This planning process can delay treatment commencement. While auto-segmentation algorithms offer a potentially time-saving solution, the challenges in defining, quantifying and achieving expert performance remain. Adopting a deep learning approach, we demonstrate a 3D U-Net architecture that achieves performance similar to experts in delineating a wide range of head and neck OARs. The model was trained on a dataset of 663 deidentified computed tomography (CT) scans acquired in routine clinical practice and segmented according to consensus OAR definitions. We demonstrate its generalisability through application to an independent test set of 24 CT scans available from The Cancer Imaging Archive collected at multiple international sites previously unseen to the model, each segmented by two independent experts and consisting of 21 OARs commonly segmented in clinical practice. With appropriate validation studies and regulatory approvals, this system could improve the effectiveness of radiotherapy pathways.

Via

Access Paper or Ask Questions

Massively Parallel Methods for Deep Reinforcement Learning

Jul 16, 2015

Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen(+4 more)

Figure 1 for Massively Parallel Methods for Deep Reinforcement Learning

Figure 2 for Massively Parallel Methods for Deep Reinforcement Learning

Figure 3 for Massively Parallel Methods for Deep Reinforcement Learning

Figure 4 for Massively Parallel Methods for Deep Reinforcement Learning

Abstract:We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the Deep Q-Network algorithm (DQN). Our distributed algorithm was applied to 49 games from Atari 2600 games from the Arcade Learning Environment, using identical hyperparameters. Our performance surpassed non-distributed DQN in 41 of the 49 games and also reduced the wall-time required to achieve these results by an order of magnitude on most games.

* Presented at the Deep Learning Workshop, International Conference on Machine Learning, Lille, France, 2015

Via

Access Paper or Ask Questions