Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

May 15, 2023

Zhiyi Zhang, Pengfei Zhang, Qi Wang

Figure 1 for Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

Figure 2 for Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

Figure 3 for Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

Figure 4 for Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

Share this with someone who'll enjoy it:

Abstract:Java is very powerful, but in Deep Learning field, its capabilities probably has not been sufficiently exploited. Compared to the Java-based deep-learning-frameworks, the Python-based (PyTorch, TensorFlow, etc) are undoubtedly the mainstream, due to their easy-to-use, flexibility and better ecosystem. Dragon-Alpha is a Java-based Tensor Computing Framework, with easy-to-use, high-scalability and high-performance, trying to break Java's dilemma in deep learning field and make it more effective. Dragon-Alpha supports different levels of APIs, and can be used as a deep-learning-framework through its user-friendly high-level APIs. Dragon-Alpha has potential to aggregate computing-power across heterogeneous platforms and devices, based on its multi-layer architecture and Java's big-data ecosystem. Dragon-Alpha has its asynchronized APIs to improve parallelism, and highly-optimized CUDA library cu32 which adopts unique convolution\deconvolution operators for small feature maps. The experiments show that, compared to PyTorch&cuDNN, Dragon-Alpha&cu32 costs less time and memory (75.38% to 97.32%, 29.2% to 66.4%), to train some typical neural networks (AlexNet, VGG, GoogleNet, ResNet) on Cifar-10.

* 7 pages. About: deep learning, deep neural networks (DNNs), system architecture, software engineering. The code of Alpha&cu32, and the experimental-data can be download at https://github.com/GilgameshXYZ123/Dragon-Alpha

View paper on

Share this with someone who'll enjoy it:

Title:Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

Paper and Code