Picture for Chi-Pin Huang

Chi-Pin Huang

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Add code
Jan 14, 2026
Viaarxiv icon

TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors

Add code
Jan 06, 2026
Viaarxiv icon

Continual Personalization for Diffusion Models

Add code
Oct 02, 2025
Figure 1 for Continual Personalization for Diffusion Models
Figure 2 for Continual Personalization for Diffusion Models
Figure 3 for Continual Personalization for Diffusion Models
Figure 4 for Continual Personalization for Diffusion Models
Viaarxiv icon

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Add code
Jul 22, 2025
Figure 1 for ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Figure 2 for ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Figure 3 for ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Figure 4 for ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Viaarxiv icon

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models

Add code
Mar 27, 2025
Viaarxiv icon

MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching

Add code
Feb 18, 2025
Viaarxiv icon

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Add code
Mar 14, 2024
Viaarxiv icon

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Add code
Nov 29, 2023
Viaarxiv icon