Picture for Huilin Deng

Huilin Deng

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Add code
Jan 09, 2026
Viaarxiv icon

Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning

Add code
Mar 10, 2025
Figure 1 for Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
Figure 2 for Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
Figure 3 for Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
Figure 4 for Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
Viaarxiv icon

VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection

Add code
Sep 30, 2024
Viaarxiv icon