Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Nov 20, 2019

Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-mei Hwu

Figure 1 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Figure 2 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Figure 3 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Figure 4 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Share this with someone who'll enjoy it:

Abstract:The past few years have seen a surge of applying Deep Learning (DL) models for a wide array of tasks such as image classification, object detection, machine translation, etc. While DL models provide an opportunity to solve otherwise intractable tasks, their adoption relies on them being optimized to meet latency and resource requirements. Benchmarking is a key step in this process but has been hampered in part due to the lack of representative and up-to-date benchmarking suites. This is exacerbated by the fast-evolving pace of DL models. This paper proposes DLBricks, a composable benchmark generation design that reduces the effort of developing, maintaining, and running DL benchmarks on CPUs. DLBricks decomposes DL models into a set of unique runnable networks and constructs the original model's performance using the performance of the generated benchmarks. DLBricks leverages two key observations: DL layers are the performance building blocks of DL models and layers are extensively repeated within and across DL models. Since benchmarks are generated automatically and the benchmarking time is minimized, DLBricks can keep up-to-date with the latest proposed models, relieving the pressure of selecting representative DL models. Moreover, DLBricks allows users to represent proprietary models within benchmark suites. We evaluate DLBricks using $50$ MXNet models spanning $5$ DL tasks on $4$ representative CPU systems. We show that DLBricks provides an accurate performance estimate for the DL models and reduces the benchmarking time across systems (e.g. within $95\%$ accuracy and up to $4.4\times$ benchmarking time speedup on Amazon EC2 c5.xlarge).

View paper on

Share this with someone who'll enjoy it:

Title:DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Paper and Code