Picture for Yihang Chen

Yihang Chen

I3DM: Implicit 3D-aware Memory Retrieval and Injection for Consistent Video Scene Generation

Add code
Mar 24, 2026
Viaarxiv icon

Fingerprinting Deep Neural Networks for Ownership Protection: An Analytical Approach

Add code
Mar 22, 2026
Viaarxiv icon

Memento-Skills: Let Agents Design Agents

Add code
Mar 19, 2026
Viaarxiv icon

TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking

Add code
Feb 03, 2026
Viaarxiv icon

Text is All You Need for Vision-Language Model Jailbreaking

Add code
Jan 31, 2026
Viaarxiv icon

Advancing Open-source World Models

Add code
Jan 28, 2026
Viaarxiv icon

Amplifying Prominent Representations in Multimodal Learning via Variational Dirichlet Process

Add code
Oct 23, 2025
Viaarxiv icon

TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation

Add code
Oct 08, 2025
Viaarxiv icon

PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration

Add code
Aug 25, 2025
Figure 1 for PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration
Figure 2 for PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration
Figure 3 for PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration
Figure 4 for PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration
Viaarxiv icon

DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models

Add code
Jun 04, 2025
Figure 1 for DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models
Figure 2 for DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models
Figure 3 for DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models
Figure 4 for DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models
Viaarxiv icon