Picture for Wei Zhai

Wei Zhai

University of Science and Technology of China, China, JD Explore Academy, JD.com, China

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Viaarxiv icon

Deep Learning-Based Feature Fusion for Emotion Analysis and Suicide Risk Differentiation in Chinese Psychological Support Hotlines

Add code
Jan 15, 2025
Viaarxiv icon

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

Event-Based Tracking Any Point with Motion-Augmented Temporal Consistency

Add code
Dec 02, 2024
Viaarxiv icon

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

Add code
Nov 29, 2024
Viaarxiv icon

Leverage Task Context for Object Affordance Ranking

Add code
Nov 25, 2024
Viaarxiv icon

Improved Video VAE for Latent Video Diffusion Model

Add code
Nov 10, 2024
Figure 1 for Improved Video VAE for Latent Video Diffusion Model
Figure 2 for Improved Video VAE for Latent Video Diffusion Model
Figure 3 for Improved Video VAE for Latent Video Diffusion Model
Figure 4 for Improved Video VAE for Latent Video Diffusion Model
Viaarxiv icon

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Add code
Oct 20, 2024
Viaarxiv icon

Visual-Geometric Collaborative Guidance for Affordance Learning

Add code
Oct 15, 2024
Viaarxiv icon

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

Add code
Oct 15, 2024
Viaarxiv icon