Picture for Qi Wang

Qi Wang

Lattice

TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization

Add code
Jun 11, 2025
Viaarxiv icon

DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding

Add code
Jun 04, 2025
Viaarxiv icon

What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning

Add code
May 28, 2025
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Add code
May 25, 2025
Viaarxiv icon

Guiding the Experts: Semantic Priors for Efficient and Focused MoE Routing

Add code
May 24, 2025
Viaarxiv icon

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Add code
May 24, 2025
Viaarxiv icon

DualComp: End-to-End Learning of a Unified Dual-Modality Lossless Compressor

Add code
May 22, 2025
Viaarxiv icon

FLARE: Robot Learning with Implicit World Modeling

Add code
May 21, 2025
Viaarxiv icon