Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Nugent

FAIR Universe HiggsML Uncertainty Challenge Competition

Oct 03, 2024

Wahid Bhimji, Paolo Calafiura, Ragansu Chakkappai, Yuan-Tang Chou, Sascha Diefenbacher, Jordan Dudley, Steven Farrell, Aishik Ghosh, Isabelle Guyon, Chris Harris(+12 more)

Figure 1 for FAIR Universe HiggsML Uncertainty Challenge Competition

Figure 2 for FAIR Universe HiggsML Uncertainty Challenge Competition

Figure 3 for FAIR Universe HiggsML Uncertainty Challenge Competition

Figure 4 for FAIR Universe HiggsML Uncertainty Challenge Competition

Abstract:The FAIR Universe -- HiggsML Uncertainty Challenge focuses on measuring the physics properties of elementary particles with imperfect simulators due to differences in modelling systematic errors. Additionally, the challenge is leveraging a large-compute-scale AI platform for sharing datasets, training models, and hosting machine learning competitions. Our challenge brings together the physics and machine learning communities to advance our understanding and methodologies in handling systematic (epistemic) uncertainties within AI techniques.

* Whitepaper for the FAIR Universe HiggsML Uncertainty Challenge Competition, available : https://fair-universe.lbl.gov

Via

Access Paper or Ask Questions

Identifying Transients in the Dark Energy Survey using Convolutional Neural Networks

Mar 18, 2022

Venkitesh Ayyar, Robert Knop Jr., Autumn Awbrey, Alexis Andersen, Peter Nugent

Figure 1 for Identifying Transients in the Dark Energy Survey using Convolutional Neural Networks

Figure 2 for Identifying Transients in the Dark Energy Survey using Convolutional Neural Networks

Figure 3 for Identifying Transients in the Dark Energy Survey using Convolutional Neural Networks

Figure 4 for Identifying Transients in the Dark Energy Survey using Convolutional Neural Networks

Abstract:The ability to discover new transients via image differencing without direct human intervention is an important task in observational astronomy. For these kind of image classification problems, machine Learning techniques such as Convolutional Neural Networks (CNNs) have shown remarkable success. In this work, we present the results of an automated transient identification on images with CNNs for an extant dataset from the Dark Energy Survey Supernova program (DES-SN), whose main focus was on using Type Ia supernovae for cosmology. By performing an architecture search of CNNs, we identify networks that efficiently select non-artifacts (e.g. supernovae, variable stars, AGN, etc.) from artifacts (image defects, mis-subtractions, etc.), achieving the efficiency of previous work performed with random Forests, without the need to expend any effort in feature identification. The CNNs also help us identify a subset of mislabeled images. Performing a relabeling of the images in this subset, the resulting classification with CNNs is significantly better than previous results.

* 14 pages, 13 figures

Via

Access Paper or Ask Questions

Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

Jun 20, 2021

Yulun Wu, Nicholas Choma, Andrew Chen, Mikaela Cashman, Érica T. Prates, Manesh Shah, Verónica G. Melesse Vergara, Austin Clyde, Thomas S. Brettin, Wibe A. de Jong(+6 more)

Figure 1 for Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

Figure 2 for Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

Figure 3 for Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

Figure 4 for Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

Abstract:We developed Distilled Graph Attention Policy Networks (DGAPNs), a curiosity-driven reinforcement learning model to generate novel graph-structured chemical representations that optimize user-defined objectives by efficiently navigating a physically constrained domain. The framework is examined on the task of generating molecules that are designed to bind, noncovalently, to functional sites of SARS-CoV-2 proteins. We present a spatial Graph Attention Network (sGAT) that leverages self-attention over both node and edge attributes as well as encoding spatial structure -- this capability is of considerable interest in areas such as molecular and synthetic biology and drug discovery. An attentional policy network is then introduced to learn decision rules for a dynamic, fragment-based chemical environment, and state-of-the-art policy gradient techniques are employed to train the network with enhanced stability. Exploration is efficiently encouraged by incorporating innovation reward bonuses learned and proposed by random network distillation. In experiments, our framework achieved outstanding results compared to state-of-the-art algorithms, while increasing the diversity of proposed molecules and reducing the complexity of paths to chemical synthesis.

Via

Access Paper or Ask Questions

The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism

Jul 25, 2020

Yosuke Oyama, Naoya Maruyama, Nikoli Dryden, Erin McCarthy, Peter Harrington, Jan Balewski, Satoshi Matsuoka, Peter Nugent, Brian Van Essen

Abstract:We present scalable hybrid-parallel algorithms for training large-scale 3D convolutional neural networks. Deep learning-based emerging scientific workflows often require model training with large, high-dimensional samples, which can make training much more costly and even infeasible due to excessive memory usage. We solve these challenges by extensively applying hybrid parallelism throughout the end-to-end training pipeline, including both computations and I/O. Our hybrid-parallel algorithm extends the standard data parallelism with spatial parallelism, which partitions a single sample in the spatial domain, realizing strong scaling beyond the mini-batch dimension with a larger aggregated memory capacity. We evaluate our proposed training algorithms with two challenging 3D CNNs, CosmoFlow and 3D U-Net. Our comprehensive performance studies show that good weak and strong scaling can be achieved for both networks using up 2K GPUs. More importantly, we enable training of CosmoFlow with much larger samples than previously possible, realizing an order-of-magnitude improvement in prediction accuracy.

* 12 pages, 10 figures

Via

Access Paper or Ask Questions