Picture for Jian Tang

Jian Tang

Baidu

MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction

Add code
Feb 17, 2026
Viaarxiv icon

RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation

Add code
Feb 15, 2026
Viaarxiv icon

CRAFT: Adapting VLA Models to Contact-rich Manipulation via Force-aware Curriculum Fine-tuning

Add code
Feb 13, 2026
Viaarxiv icon

TC-IDM: Grounding Video Generation for Executable Zero-shot Robot Motion

Add code
Jan 26, 2026
Viaarxiv icon

Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test

Add code
Jan 07, 2026
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Real-world Reinforcement Learning from Suboptimal Interventions

Add code
Dec 30, 2025
Viaarxiv icon

Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning

Add code
Dec 26, 2025
Figure 1 for Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning
Figure 2 for Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning
Figure 3 for Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning
Figure 4 for Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning
Viaarxiv icon

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

Add code
Sep 30, 2025
Figure 1 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 2 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 3 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 4 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon