Picture for Kim-Hui Yap

Kim-Hui Yap

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Add code
Jun 18, 2026
Viaarxiv icon

Demystifying Data Organization for Enhanced LLM Training

Add code
May 28, 2026
Viaarxiv icon

NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results

Add code
Apr 09, 2026
Viaarxiv icon

Mixture-of-Modality-Experts with Holistic Token Learning for Fine-Grained Multimodal Visual Analytics in Driver Action Recognition

Add code
Apr 07, 2026
Viaarxiv icon

Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep

Add code
Mar 25, 2026
Viaarxiv icon

DAOS: A Multimodal In-cabin Behavior Monitoring with Driver Action-Object Synergy Dataset

Add code
Jan 17, 2026
Viaarxiv icon

MuseAgent-1: Interactive Grounded Multimodal Understanding of Music Scores and Performance Audio

Add code
Jan 17, 2026
Viaarxiv icon

Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges

Add code
Nov 17, 2025
Viaarxiv icon

Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework

Add code
Jul 30, 2025
Viaarxiv icon

SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal

Add code
May 08, 2025
Viaarxiv icon