Picture for Chung-Ching Lin

Chung-Ching Lin

GenXD: Generating Any 3D and 4D Scenes

Add code
Nov 05, 2024
Figure 1 for GenXD: Generating Any 3D and 4D Scenes
Figure 2 for GenXD: Generating Any 3D and 4D Scenes
Figure 3 for GenXD: Generating Any 3D and 4D Scenes
Figure 4 for GenXD: Generating Any 3D and 4D Scenes
Viaarxiv icon

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Add code
Oct 30, 2024
Figure 1 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 2 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 3 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 4 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Viaarxiv icon

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Add code
Aug 01, 2024
Figure 1 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Viaarxiv icon

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Figure 1 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 2 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 3 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 4 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Viaarxiv icon

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Add code
Jun 11, 2024
Viaarxiv icon

A Unified Gaussian Process for Branching and Nested Hyperparameter Optimization

Add code
Jan 19, 2024
Viaarxiv icon

MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning

Add code
Nov 29, 2023
Viaarxiv icon

MM-VID: Advancing Video Understanding with GPT-4V

Add code
Oct 30, 2023
Viaarxiv icon

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Add code
Oct 12, 2023
Viaarxiv icon

The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

Add code
Oct 11, 2023
Viaarxiv icon