Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haichuan Gao

Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Jan 26, 2022

Yihan Li, Jinsheng Ren, Tianrun Xu, Tianren Zhang, Haichuan Gao, Feng Chen

Figure 1 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 2 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 3 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 4 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Abstract:Recently, incorporating natural language instructions into reinforcement learning (RL) to learn semantically meaningful representations and foster generalization has caught many concerns. However, the semantical information in language instructions is usually entangled with task-specific state information, which hampers the learning of semantically invariant and reusable representations. In this paper, we propose a method to learn such representations called element randomization, which extracts task-relevant but environment-agnostic semantics from instructions using a set of environments with randomized elements, e.g., topological structures or textures, yet the same language instruction. We theoretically prove the feasibility of learning semantically invariant representations through randomization. In practice, we accordingly develop a hierarchy of policies, where a high-level policy is designed to modulate the behavior of a goal-conditioned low-level policy by proposing subgoals as semantically invariant representations. Experiments on challenging long-horizon tasks show that (1) our low-level policy reliably generalizes to tasks against environment changes; (2) our hierarchical policy exhibits extensible generalization in unseen new tasks that can be decomposed into several solvable sub-tasks; and (3) by storing and replaying language trajectories as succinct policy representations, the agent can complete tasks in a one-shot fashion, i.e., once one successful trajectory has been attained.

Via

Access Paper or Ask Questions

CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Jul 02, 2021

Chongkai Gao, Haichuan Gao, Shangqi Guo, Tianren Zhang, Feng Chen

Figure 1 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 2 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 3 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 4 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Abstract:Imitation learning (IL) algorithms have shown promising results for robots to learn skills from expert demonstrations. However, they need multi-task demonstrations to be provided at once for acquiring diverse skills, which is difficult in real world. In this work we study how to realize continual imitation learning ability that empowers robots to continually learn new tasks one by one, thus reducing the burden of multi-task IL and accelerating the process of new task learning at the same time. We propose a novel trajectory generation model that employs both a generative adversarial network and a dynamics-aware prediction model to generate pseudo trajectories from all learned tasks in the new task learning process. Our experiments on both simulation and real-world manipulation tasks demonstrate the effectiveness of our method.

Via

Access Paper or Ask Questions