Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaowei Xing

The Adaptive Dynamic Programming Toolbox

Dec 29, 2020

Xiaowei Xing, Dong Eui Chang

Figure 1 for The Adaptive Dynamic Programming Toolbox

Figure 2 for The Adaptive Dynamic Programming Toolbox

Figure 3 for The Adaptive Dynamic Programming Toolbox

Figure 4 for The Adaptive Dynamic Programming Toolbox

Abstract:The paper develops the Adaptive Dynamic Programming Toolbox (ADPT), which solves optimal control problems for continuous-time nonlinear systems. Based on the adaptive dynamic programming technique, the ADPT computes optimal feedback controls from the system dynamics in the model-based working mode, or from measurements of trajectories of the system in the model-free working mode without the requirement of knowledge of the system model. Multiple options are provided such that the ADPT can accommodate various customized circumstances. Compared to other popular software toolboxes for optimal control, the ADPT enjoys its computational precision and speed, which is illustrated with its applications to a satellite attitude control problem.

Via

Access Paper or Ask Questions

Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Sep 06, 2019

Xiaowei Xing, Dong Eui Chang

Figure 1 for Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Figure 2 for Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Figure 3 for Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Figure 4 for Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Abstract:Deep reinforcement learning trains neural networks using experiences sampled from the replay buffer, which is commonly updated at each time step. In this paper, we propose a method to update the replay buffer adaptively and selectively to train a robot arm to accomplish a suction task in simulation. The response time of the agent is thoroughly taken into account. The state transitions that remain stuck at the boundary of constraint are not stored. The policy trained with our method works better than the one with the common replay buffer update method. The result is demonstrated both by simulation and by experiment with a real robot arm.

* Appearing in The 19th International Conference on Control, Automation and Systems, Jeju, Korea, 2019

Via

Access Paper or Ask Questions