Picture for Yuyin Zhou

Yuyin Zhou

Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation

Add code
Mar 17, 2026
Viaarxiv icon

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Add code
Mar 17, 2026
Viaarxiv icon

VecGlypher: Unified Vector Glyph Generation with Language Models

Add code
Feb 25, 2026
Viaarxiv icon

EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models

Add code
Feb 19, 2026
Viaarxiv icon

What if Agents Could Imagine? Reinforcing Open-Vocabulary HOI Comprehension through Generation

Add code
Feb 12, 2026
Viaarxiv icon

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Add code
Jan 21, 2026
Viaarxiv icon

Controllable Layered Image Generation for Real-World Editing

Add code
Jan 21, 2026
Viaarxiv icon

All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs

Add code
Dec 08, 2025
Viaarxiv icon

MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Add code
Oct 29, 2025
Viaarxiv icon

GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering

Add code
Oct 16, 2025
Viaarxiv icon