Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tie Zhang

Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Feb 19, 2024

Yifei Cheng, Li Shen, Linli Xu, Xun Qian, Shiwei Wu, Yiming Zhou, Tie Zhang, Dacheng Tao, Enhong Chen

Figure 1 for Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Figure 2 for Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Figure 3 for Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Figure 4 for Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Abstract:Gradient compression with error compensation has attracted significant attention with the target of reducing the heavy communication overhead in distributed learning. However, existing compression methods either perform only unidirectional compression in one iteration with higher communication cost, or bidirectional compression with slower convergence rate. In this work, we propose the Local Immediate Error Compensated SGD (LIEC-SGD) optimization algorithm to break the above bottlenecks based on bidirectional compression and carefully designed compensation approaches. Specifically, the bidirectional compression technique is to reduce the communication cost, and the compensation technique compensates the local compression error to the model update immediately while only maintaining the global error variable on the server throughout the iterations to boost its efficacy. Theoretically, we prove that LIEC-SGD is superior to previous works in either the convergence rate or the communication cost, which indicates that LIEC-SGD could inherit the dual advantages from unidirectional compression and bidirectional compression. Finally, experiments of training deep neural networks validate the effectiveness of the proposed LIEC-SGD algorithm.

Via

Access Paper or Ask Questions

Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

Aug 03, 2019

Jiadong Xiao, Lin Li, Tie Zhang, Yanbiao Zou

Figure 1 for Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

Figure 2 for Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

Figure 3 for Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

Figure 4 for Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

Abstract:In pursuit of the time-optimal path tracking (TOPT) trajectory of a robot manipulator along a preset path, a beforehand identified robot dynamic model is usually used to obtain the required optimal trajectory for perfect tracking. However, due to the inevitable model-plant mismatch, there may be a big error between the actually measured torques and the calculated torques by the dynamic model, which causes the obtained trajectory to be suboptimal or even be infeasible by exceeding given limits. This paper presents a TOPT-oriented SARSA algorithm (TOPTO-SARSA) and a two-step method for finding the time-optimal motion and ensuring the feasibility : Firstly, using TOPTO-SARSA to find a safe trajectory that satisfies the kinematic constraints through the interaction between reinforcement learning agent and kinematic model. Secondly, using TOPTO-SARSA to find the optimal trajectory through the interaction between the agent and the real world, and assure the actually measured torques satisfy the given limits at the last interaction. The effectiveness of the proposed algorithm has been verified through experiments on a 6-DOF robot manipulator.

* 8 pages,9 figures

Via

Access Paper or Ask Questions

Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Jun 30, 2019

Jiadong Xiao, Lin Li, Yanbiao Zou, Tie Zhang

Figure 1 for Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Figure 2 for Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Figure 3 for Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Figure 4 for Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

Abstract:Time-optimal path tracking, as a significant tool for industrial robots, has attracted the attention of numerous researchers. In most time-optimal path tracking problems, the actuator torque constraints are assumed to be conservative, which ignores the motor characteristic; i.e., the actuator torque constraints are velocity-dependent, and the relationship between torque and velocity is piecewise linear. However, considering that the motor characteristics increase the solving difficulty, in this study, an improved Q-learning algorithm for robotic time-optimal path tracking using prior knowledge is proposed. After considering the limitations of the Q-learning algorithm, an improved action-value function is proposed to improve the convergence rate. The proposed algorithms use the idea of reward and penalty, rewarding the actions that satisfy constraint conditions and penalizing the actions that break constraint conditions, to finally obtain a time-optimal trajectory that satisfies the constraint conditions. The effectiveness of the algorithms is verified by experiments.

* 27 pages, 14 figures, 4 Tables

Via

Access Paper or Ask Questions

ISIC 2018-A Method for Lesion Segmentation

Jul 21, 2018

Hongdiao Wen, Rongjian Xu, Tie Zhang

Figure 1 for ISIC 2018-A Method for Lesion Segmentation

Figure 2 for ISIC 2018-A Method for Lesion Segmentation

Abstract:Our team participate in the challenge of Task 1: Lesion Boundary Segmentation , and use a combined network, one of which is designed by ourselves named updcnn net and another is an improved VGG 16-layer net. Updcnn net uses reduced size images for training, and VGG 16-layer net utilizes large size images. Image enhancement is used to get a richer data set. We use boxes in the VGG 16-layer net network for local attention regularization to fine-tune the loss function, which can increase the number of training data, and also make the model more robust. In the test, the model is used for joint testing and achieves good results.

Via

Access Paper or Ask Questions