Picture for Yi-Hsuan Tsai

Yi-Hsuan Tsai

What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning

Add code
Mar 27, 2025
Viaarxiv icon

uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images

Add code
Mar 27, 2025
Viaarxiv icon

Exemplar Masking for Multimodal Incremental Learning

Add code
Dec 12, 2024
Viaarxiv icon

Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models

Add code
Dec 09, 2024
Viaarxiv icon

Ranking-aware adapter for text-driven image ordering with CLIP

Add code
Dec 09, 2024
Viaarxiv icon

Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation

Add code
Sep 29, 2024
Figure 1 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 2 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 3 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 4 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Viaarxiv icon

Self-training Room Layout Estimation via Geometry-aware Ray-casting

Add code
Jul 21, 2024
Figure 1 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 2 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 3 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 4 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Viaarxiv icon

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Add code
Jul 10, 2024
Figure 1 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 2 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 3 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 4 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Viaarxiv icon

Gaga: Group Any Gaussians via 3D-aware Memory Bank

Add code
Apr 11, 2024
Figure 1 for Gaga: Group Any Gaussians via 3D-aware Memory Bank
Figure 2 for Gaga: Group Any Gaussians via 3D-aware Memory Bank
Figure 3 for Gaga: Group Any Gaussians via 3D-aware Memory Bank
Figure 4 for Gaga: Group Any Gaussians via 3D-aware Memory Bank
Viaarxiv icon

PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection

Add code
Dec 13, 2023
Viaarxiv icon