Picture for Guang Shi

Guang Shi

Context Unrolling in Omni Models

Add code
Apr 23, 2026
Viaarxiv icon

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

Depth Anything 3: Recovering the Visual Space from Any Views

Add code
Nov 13, 2025
Figure 1 for Depth Anything 3: Recovering the Visual Space from Any Views
Figure 2 for Depth Anything 3: Recovering the Visual Space from Any Views
Figure 3 for Depth Anything 3: Recovering the Visual Space from Any Views
Figure 4 for Depth Anything 3: Recovering the Visual Space from Any Views
Viaarxiv icon

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Add code
Nov 12, 2025
Figure 1 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 2 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 3 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 4 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Viaarxiv icon

Understanding Transformer from the Perspective of Associative Memory

Add code
May 26, 2025
Viaarxiv icon

Emerging Properties in Unified Multimodal Pretraining

Add code
May 20, 2025
Figure 1 for Emerging Properties in Unified Multimodal Pretraining
Figure 2 for Emerging Properties in Unified Multimodal Pretraining
Figure 3 for Emerging Properties in Unified Multimodal Pretraining
Figure 4 for Emerging Properties in Unified Multimodal Pretraining
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon