Picture for Jiangmiao Pang

Jiangmiao Pang

ChangingGrounding: 3D Visual Grounding in Changing Scenes

Add code
Oct 16, 2025
Viaarxiv icon

Towards Adaptable Humanoid Control via Adaptive Motion Tracking

Add code
Oct 16, 2025
Viaarxiv icon

Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning

Add code
Oct 16, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Add code
Sep 19, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Add code
Sep 09, 2025
Figure 1 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 2 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 3 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 4 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Viaarxiv icon

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Add code
Aug 27, 2025
Viaarxiv icon

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Add code
Aug 20, 2025
Viaarxiv icon

VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization

Add code
Aug 07, 2025
Viaarxiv icon