Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ESPnet-ONNX: Bridging a Gap Between Research and Production

Sep 20, 2022

Masao Someki, Yosuke Higuchi, Tomoki Hayashi, Shinji Watanabe

Figure 1 for ESPnet-ONNX: Bridging a Gap Between Research and Production

Figure 2 for ESPnet-ONNX: Bridging a Gap Between Research and Production

Figure 3 for ESPnet-ONNX: Bridging a Gap Between Research and Production

Figure 4 for ESPnet-ONNX: Bridging a Gap Between Research and Production

Share this with someone who'll enjoy it:

Abstract:In the field of deep learning, researchers often focus on inventing novel neural network models and improving benchmarks. In contrast, application developers are interested in making models suitable for actual products, which involves optimizing a model for faster inference and adapting a model to various platforms (e.g., C++ and Python). In this work, to fill the gap between the two, we establish an effective procedure for optimizing a PyTorch-based research-oriented model for deployment, taking ESPnet, a widely used toolkit for speech processing, as an instance. We introduce different techniques to ESPnet, including converting a model into an ONNX format, fusing nodes in a graph, and quantizing parameters, which lead to approximately 1.3-2$\times$ speedup in various tasks (i.e., ASR, TTS, speech translation, and spoken language understanding) while keeping its performance without any additional training. Our ESPnet-ONNX will be publicly available at https://github.com/espnet/espnet_onnx

* Accepted to APSIPA ASC 2022

View paper on

Share this with someone who'll enjoy it:

Title:ESPnet-ONNX: Bridging a Gap Between Research and Production

Paper and Code