Picture for Yang Jin

Yang Jin

Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

Add code
Dec 24, 2025
Viaarxiv icon

ImplicitRDP: An End-to-End Visual-Force Diffusion Policy with Structural Slow-Fast Learning

Add code
Dec 11, 2025
Viaarxiv icon

ARMADA: Autonomous Online Failure Detection and Human Shared Control Empower Scalable Real-world Deployment and Adaptation

Add code
Oct 02, 2025
Viaarxiv icon

SIME: Enhancing Policy Self-Improvement with Modal-level Exploration

Add code
May 02, 2025
Viaarxiv icon

Pyramidal Flow Matching for Efficient Video Generative Modeling

Add code
Oct 08, 2024
Viaarxiv icon

Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model

Add code
Aug 02, 2024
Viaarxiv icon

RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Add code
May 23, 2024
Viaarxiv icon

DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model

Add code
May 12, 2024
Figure 1 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 2 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 3 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 4 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Viaarxiv icon

Harder Tasks Need More Experts: Dynamic Routing in MoE Models

Add code
Mar 12, 2024
Viaarxiv icon

TransGOP: Transformer-Based Gaze Object Prediction

Add code
Feb 21, 2024
Figure 1 for TransGOP: Transformer-Based Gaze Object Prediction
Figure 2 for TransGOP: Transformer-Based Gaze Object Prediction
Figure 3 for TransGOP: Transformer-Based Gaze Object Prediction
Figure 4 for TransGOP: Transformer-Based Gaze Object Prediction
Viaarxiv icon