Picture for Qi Wang

Qi Wang

Lattice

From Bounding Boxes to Visual Reasoning: An On-Policy Data Annotation Tool for Vision-Language Models

Add code
Jun 17, 2026
Viaarxiv icon

Automated jailbreak attack targeting multiple defense strategies

Add code
Jun 15, 2026
Viaarxiv icon

An Embodied Simulation Platform, Benchmark, and Data-Efficient Augmentation Framework for Wet-Lab Robotics

Add code
Jun 11, 2026
Viaarxiv icon

TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon

BiNSGPS: Geometry Problem Solving via Bidirectional Neuro-Symbolic Interaction

Add code
Jun 03, 2026
Viaarxiv icon

Policy and World Modeling Co-Training for Language Agents

Add code
Jun 01, 2026
Viaarxiv icon

RLVR without Ineffective Samples: Group Prioritized Off-Policy Optimization for LLM Reasoning

Add code
May 31, 2026
Viaarxiv icon

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

Add code
May 26, 2026
Viaarxiv icon

StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration

Add code
May 25, 2026
Viaarxiv icon

OctCGS: Octree-Contextual Gaussian Splatting with Explicit Multi-Order Propagation Modeling for Channel Knowledge Map Construction

Add code
May 21, 2026
Viaarxiv icon