Picture for Zheng-Jun Zha

Zheng-Jun Zha

University of Science and Technology of China

HERO: Human Reaction Generation from Videos

Add code
Mar 11, 2025
Viaarxiv icon

Pixel to Gaussian: Ultra-Fast Continuous Super-Resolution with 2D Gaussian Modeling

Add code
Mar 09, 2025
Viaarxiv icon

Get In Video: Add Anything You Want to the Video

Add code
Mar 08, 2025
Viaarxiv icon

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

Add code
Mar 03, 2025
Viaarxiv icon

Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration

Add code
Jan 27, 2025
Figure 1 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 2 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 3 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 4 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Viaarxiv icon

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Viaarxiv icon

RAIN: Real-time Animation of Infinite Video Stream

Add code
Dec 27, 2024
Figure 1 for RAIN: Real-time Animation of Infinite Video Stream
Figure 2 for RAIN: Real-time Animation of Infinite Video Stream
Figure 3 for RAIN: Real-time Animation of Infinite Video Stream
Figure 4 for RAIN: Real-time Animation of Infinite Video Stream
Viaarxiv icon

SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation

Add code
Dec 20, 2024
Figure 1 for SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation
Figure 2 for SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation
Figure 3 for SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation
Figure 4 for SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation
Viaarxiv icon

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

Add code
Nov 29, 2024
Viaarxiv icon