Picture for Dacheng Tao

Dacheng Tao

JD Explore Academy, JD.com, China

Stability and Generalization for Distributed SGDA

Add code
Nov 14, 2024
Viaarxiv icon

Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization

Add code
Nov 02, 2024
Figure 1 for Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
Figure 2 for Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
Figure 3 for Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
Figure 4 for Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
Viaarxiv icon

Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

Add code
Nov 02, 2024
Viaarxiv icon

Communication Learning in Multi-Agent Systems from Graph Modeling Perspective

Add code
Nov 01, 2024
Viaarxiv icon

Offline Behavior Distillation

Add code
Oct 30, 2024
Viaarxiv icon

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging

Add code
Oct 29, 2024
Viaarxiv icon

Foundation Models for Remote Sensing and Earth Observation: A Survey

Add code
Oct 22, 2024
Viaarxiv icon

Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces

Add code
Oct 21, 2024
Viaarxiv icon

SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery

Add code
Oct 18, 2024
Viaarxiv icon

Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL

Add code
Oct 15, 2024
Viaarxiv icon