Picture for Zhejian Yang

Zhejian Yang

A Survey on Post-training of Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon

Towards Robust Multi-UAV Collaboration: MARL with Noise-Resilient Communication and Attention Mechanisms

Add code
Mar 04, 2025
Viaarxiv icon

Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces

Add code
Oct 21, 2024
Viaarxiv icon

Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal

Add code
Sep 04, 2024
Figure 1 for Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
Figure 2 for Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
Figure 3 for Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
Figure 4 for Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
Viaarxiv icon