Picture for Xinyuan Chen

Xinyuan Chen

GMG: A Video Prediction Method Based on Global Focus and Motion Guided

Add code
Mar 14, 2025
Viaarxiv icon

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis

Add code
Mar 13, 2025
Viaarxiv icon

TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision

Add code
Mar 10, 2025
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Add code
Jan 14, 2025
Figure 1 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 2 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 3 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 4 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Viaarxiv icon

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Add code
Dec 30, 2024
Figure 1 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 2 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 3 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 4 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Viaarxiv icon

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Add code
Nov 20, 2024
Figure 1 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 2 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 3 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 4 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Viaarxiv icon

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Add code
Jul 23, 2024
Viaarxiv icon

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Add code
Jun 05, 2024
Viaarxiv icon

4Diffusion: Multi-view Video Diffusion Model for 4D Generation

Add code
May 31, 2024
Viaarxiv icon