Picture for Yang Yu

Yang Yu

Tsinghua University

Controlling Large Language Model with Latent Actions

Add code
Mar 27, 2025
Viaarxiv icon

NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios

Add code
Mar 25, 2025
Viaarxiv icon

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Add code
Mar 14, 2025
Viaarxiv icon

Using Subgraph GNNs for Node Classification:an Overlooked Potential Approach

Add code
Mar 09, 2025
Viaarxiv icon

MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations

Add code
Mar 06, 2025
Figure 1 for MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Figure 2 for MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Figure 3 for MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Figure 4 for MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Viaarxiv icon

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Add code
Feb 17, 2025
Viaarxiv icon

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning

Add code
Feb 07, 2025
Viaarxiv icon

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Add code
Dec 11, 2024
Figure 1 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 2 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 3 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 4 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Viaarxiv icon

Universal and Context-Independent Triggers for Precise Control of LLM Outputs

Add code
Nov 22, 2024
Viaarxiv icon

Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay

Add code
Nov 16, 2024
Figure 1 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 2 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 3 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 4 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Viaarxiv icon