Picture for Zhijie Lin

Zhijie Lin

How Far is Video Generation from World Model: A Physical Law Perspective

Add code
Nov 04, 2024
Figure 1 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 2 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 3 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 4 for How Far is Video Generation from World Model: A Physical Law Perspective
Viaarxiv icon

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Add code
Oct 14, 2024
Viaarxiv icon

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Add code
Oct 03, 2024
Viaarxiv icon

LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

Add code
Jul 05, 2024
Viaarxiv icon

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Add code
Apr 29, 2024
Viaarxiv icon

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Add code
Jan 09, 2024
Viaarxiv icon

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Add code
Nov 12, 2023
Viaarxiv icon

Towards Garment Sewing Pattern Reconstruction from a Single Image

Add code
Nov 07, 2023
Viaarxiv icon

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Add code
Jul 17, 2023
Figure 1 for BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Figure 2 for BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Figure 3 for BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Figure 4 for BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Viaarxiv icon

DATE: Domain Adaptive Product Seeker for E-commerce

Add code
Apr 07, 2023
Viaarxiv icon