Picture for Xiaolong Li

Xiaolong Li

Sherman

3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing

Add code
Mar 23, 2026
Viaarxiv icon

TokenCom: Vision-Language Model for Multimodal and Multitask Token Communications

Add code
Feb 28, 2026
Viaarxiv icon

Rigidity-Based Multi-Finger Coordination for Precise In-Hand Manipulation of Force-Sensitive Objects

Add code
Feb 15, 2026
Viaarxiv icon

Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents

Add code
Feb 13, 2026
Viaarxiv icon

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Add code
Jan 27, 2026
Viaarxiv icon

HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation

Add code
Dec 29, 2025
Viaarxiv icon

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Add code
Nov 12, 2025
Viaarxiv icon

BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs

Add code
Sep 30, 2025
Viaarxiv icon

3D Aware Region Prompted Vision Language Model

Add code
Sep 16, 2025
Figure 1 for 3D Aware Region Prompted Vision Language Model
Figure 2 for 3D Aware Region Prompted Vision Language Model
Figure 3 for 3D Aware Region Prompted Vision Language Model
Figure 4 for 3D Aware Region Prompted Vision Language Model
Viaarxiv icon

RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents

Add code
Jul 30, 2025
Figure 1 for RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
Figure 2 for RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
Figure 3 for RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
Figure 4 for RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
Viaarxiv icon