Picture for Yang Yu

Yang Yu

Tsinghua University

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

Add code
May 07, 2026
Viaarxiv icon

Anticipation-VLA: Solving Long-Horizon Embodied Tasks via Anticipation-based Subgoal Generation

Add code
May 03, 2026
Viaarxiv icon

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Add code
May 03, 2026
Viaarxiv icon

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Add code
Apr 27, 2026
Viaarxiv icon

On Benchmark Hacking in ML Contests: Modeling, Insights and Design

Add code
Apr 24, 2026
Viaarxiv icon

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis

Add code
Apr 11, 2026
Viaarxiv icon

ReinVBC: A Model-based Reinforcement Learning Approach to Vehicle Braking Controller

Add code
Apr 06, 2026
Viaarxiv icon

VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

Add code
Mar 24, 2026
Viaarxiv icon

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Add code
Mar 24, 2026
Viaarxiv icon