Picture for Li Zhu

Li Zhu

StepGuard: Guarding Web Navigation via Single-Step Calibration

Add code
Jun 16, 2026
Viaarxiv icon

MultiDx: A Multi-Source Knowledge Integration Framework towards Diagnostic Reasoning

Add code
Apr 27, 2026
Viaarxiv icon

AdapTime: Enabling Adaptive Temporal Reasoning in Large Language Models

Add code
Apr 27, 2026
Viaarxiv icon

Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation

Add code
Apr 02, 2026
Viaarxiv icon

Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting

Add code
Mar 29, 2026
Viaarxiv icon

Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection

Add code
Mar 02, 2026
Viaarxiv icon

Semantic-Deviation-Anchored Multi-Branch Fusion for Unsupervised Anomaly Detection and Localization in Unstructured Conveyor-Belt Coal Scenes

Add code
Feb 07, 2026
Viaarxiv icon

Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation

Add code
Jan 29, 2026
Viaarxiv icon

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Add code
Jan 18, 2026
Viaarxiv icon

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Add code
Jan 18, 2026
Viaarxiv icon