Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Corey Adams

PILArNet: Public Dataset for Particle Imaging Liquid Argon Detectors in High Energy Physics

Jun 03, 2020

Corey Adams, Kazuhiro Terao, Taritree Wongjirad

Figure 1 for PILArNet: Public Dataset for Particle Imaging Liquid Argon Detectors in High Energy Physics

Figure 2 for PILArNet: Public Dataset for Particle Imaging Liquid Argon Detectors in High Energy Physics

Figure 3 for PILArNet: Public Dataset for Particle Imaging Liquid Argon Detectors in High Energy Physics

Figure 4 for PILArNet: Public Dataset for Particle Imaging Liquid Argon Detectors in High Energy Physics

Abstract:Rapid advancement of machine learning solutions has often coincided with the production of a test public data set. Such datasets reduce the largest barrier to entry for tackling a problem -- procuring data -- while also providing a benchmark to compare different solutions. Furthermore, large datasets have been used to train high-performing feature finders which are then used in new approaches to problems beyond that initially defined. In order to encourage the rapid development in the analysis of data collected using liquid argon time projection chambers, a class of particle detectors used in high energy physics experiments, we have produced the PILArNet, first 2D and 3D open dataset to be used for a couple of key analysis tasks. The initial dataset presented in this paper contains 300,000 samples simulated and recorded in three different volume sizes. The dataset is stored efficiently in sparse 2D and 3D matrix format with auxiliary information about simulated particles in the volume, and is made available for public research use. In this paper we describe the dataset, tasks, and the method used to procure the sample.

Via

Access Paper or Ask Questions

Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping

May 13, 2019

Wushi Dong, Murat Keceli, Rafael Vescovi, Hanyu Li, Corey Adams, Tom Uram, Venkatram Vishwanath, Bobby Kasthuri, Nicola Ferrier, Peter Littlewood

Figure 1 for Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping

Figure 2 for Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping

Figure 3 for Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping

Figure 4 for Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping

Abstract:Mapping all the neurons in the brain requires automatic reconstruction of entire cells from volume electron microscopy data. The flood-filling networks (FFN) architecture can achieve leading performance. However, the training of the network is computationally very expensive. In order to reduce the training time, we implemented synchronous and data-parallel distributed training using the Horovod framework on top of the published FFN code. We demonstrated the scaling of FFN training up to 1024 Intel Knights Landing (KNL) nodes at Argonne Leadership Computing Facility. We investigated the training accuracy with different optimizers, learning rates, and optional warm-up periods. We discovered that square root scaling for learning rate works best beyond 16 nodes, which is contrary to the case of smaller number of nodes, where linear learning rate scaling with warm-up performs the best. Our distributed training reaches 95% accuracy in approximately 4.5 hours on 1024 KNL nodes using Adam optimizer.

* 7 pages, 7 figures

Via

Access Paper or Ask Questions