Picture for Yong Jae Lee

Yong Jae Lee

Personal AI Agent for Camera Roll VQA

Add code
Jun 03, 2026
Viaarxiv icon

MAOAM: Unified Object and Material Selection with Vision-Language Models

Add code
Jun 02, 2026
Viaarxiv icon

Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior

Add code
May 26, 2026
Viaarxiv icon

Your Embedding Model is SMARTer Than You Think

Add code
May 24, 2026
Viaarxiv icon

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

Add code
May 14, 2026
Viaarxiv icon

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Add code
Apr 14, 2026
Viaarxiv icon

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Add code
Mar 26, 2026
Viaarxiv icon

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Add code
Mar 18, 2026
Viaarxiv icon

Spatially Grounded Long-Horizon Task Planning in the Wild

Add code
Mar 13, 2026
Viaarxiv icon

Reasoning-Augmented Representations for Multimodal Retrieval

Add code
Feb 06, 2026
Viaarxiv icon