Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ashish Malik

Unlearning via Sparse Representations

Nov 26, 2023

Vedant Shah, Frederik Träuble, Ashish Malik, Hugo Larochelle, Michael Mozer, Sanjeev Arora, Yoshua Bengio, Anirudh Goyal

Abstract:Machine \emph{unlearning}, which involves erasing knowledge about a \emph{forget set} from a trained model, can prove to be costly and infeasible by existing techniques. We propose a nearly compute-free zero-shot unlearning technique based on a discrete representational bottleneck. We show that the proposed technique efficiently unlearns the forget set and incurs negligible damage to the model's performance on the rest of the data set. We evaluate the proposed technique on the problem of \textit{class unlearning} using three datasets: CIFAR-10, CIFAR-100, and LACUNA-100. We compare the proposed technique to SCRUB, a state-of-the-art approach which uses knowledge distillation for unlearning. Across all three datasets, the proposed technique performs as well as, if not better than SCRUB while incurring almost no computational cost.

Via

Access Paper or Ask Questions

Learning Dynamic Bipedal Walking Across Stepping Stones

May 03, 2022

Helei Duan, Ashish Malik, Mohitvishnu S. Gadde, Jeremy Dao, Alan Fern, Jonathan Hurst

Figure 1 for Learning Dynamic Bipedal Walking Across Stepping Stones

Figure 2 for Learning Dynamic Bipedal Walking Across Stepping Stones

Figure 3 for Learning Dynamic Bipedal Walking Across Stepping Stones

Figure 4 for Learning Dynamic Bipedal Walking Across Stepping Stones

Abstract:In this work, we propose a learning approach for 3D dynamic bipedal walking when footsteps are constrained to stepping stones. While recent work has shown progress on this problem, real-world demonstrations have been limited to relatively simple open-loop, perception-free scenarios. Our main contribution is a more advanced learning approach that enables real-world demonstrations, using the Cassie robot, of closed-loop dynamic walking over moderately difficult stepping-stone patterns. Our approach first uses reinforcement learning (RL) in simulation to train a controller that maps footstep commands onto joint actions without any reference motion information. We then learn a model of that controller's capabilities, which enables prediction of feasible footsteps given the robot's current dynamic state. The resulting controller and model are then integrated with a real-time overhead camera system for detecting stepping stone locations. For evaluation, we develop a benchmark set of stepping stone patterns, which are used to test performance in both simulation and the real world. Overall, we demonstrate that sim-to-real learning is extremely promising for enabling dynamic locomotion over stepping stones. We also identify challenges remaining that motivate important future research directions.

* Video will be uploaded later

Via

Access Paper or Ask Questions

Sim-to-Real Learning of Footstep-Constrained Bipedal Dynamic Walking

Mar 15, 2022

Helei Duan, Ashish Malik, Jeremy Dao, Aseem Saxena, Kevin Green, Jonah Siekmann, Alan Fern, Jonathan Hurst

Figure 1 for Sim-to-Real Learning of Footstep-Constrained Bipedal Dynamic Walking

Figure 2 for Sim-to-Real Learning of Footstep-Constrained Bipedal Dynamic Walking

Figure 3 for Sim-to-Real Learning of Footstep-Constrained Bipedal Dynamic Walking

Figure 4 for Sim-to-Real Learning of Footstep-Constrained Bipedal Dynamic Walking

Abstract:Recently, work on reinforcement learning (RL) for bipedal robots has successfully learned controllers for a variety of dynamic gaits with robust sim-to-real demonstrations. In order to maintain balance, the learned controllers have full freedom of where to place the feet, resulting in highly robust gaits. In the real world however, the environment will often impose constraints on the feasible footstep locations, typically identified by perception systems. Unfortunately, most demonstrated RL controllers on bipedal robots do not allow for specifying and responding to such constraints. This missing control interface greatly limits the real-world application of current RL controllers. In this paper, we aim to maintain the robust and dynamic nature of learned gaits while also respecting footstep constraints imposed externally. We develop an RL formulation for training dynamic gait controllers that can respond to specified touchdown locations. We then successfully demonstrate simulation and sim-to-real performance on the bipedal robot Cassie. In addition, we use supervised learning to induce a transition model for accurately predicting the next touchdown locations that the controller can achieve given the robot's proprioceptive observations. This model paves the way for integrating the learned controller into a full-order robot locomotion planner that robustly satisfies both balance and environmental constraints.

* Accepted at ICRA 2022. Video will be uploaded later

Via

Access Paper or Ask Questions

Zero-shot generalization using cascaded system-representations

Dec 11, 2019

Ashish Malik

Figure 1 for Zero-shot generalization using cascaded system-representations

Figure 2 for Zero-shot generalization using cascaded system-representations

Figure 3 for Zero-shot generalization using cascaded system-representations

Figure 4 for Zero-shot generalization using cascaded system-representations

Abstract:This paper proposes a new framework named CASNET to learn control policies that generalize over similar robot types with different morphologies. The proposed framework leverages the structural similarities in robots to learn general-purpose system-representations. These representations can then be used with the choice of learning algorithms to learn policies that generalize over different robots. The learned policies can be used to design general-purpose robot-controllers that are applicable to a wide variety of robots. We demonstrate the effectiveness of the proposed framework by learning control policies for two separate domains: planer manipulation and legged locomotion. The policy learned for planer manipulation is capable of controlling planer manipulators with varying degrees of freedom and link-lengths. For legged locomotion, the learned policy generalizes over different morphologies of the crawling robots. These policies perform on-par with the expert policies trained for individual robot models and achieves zero-shot generalization on models unseen during training, establishing that the final performance of the general policy is bottlenecked by the learning algorithm rather than the proposed framework.

Via

Access Paper or Ask Questions