Picture for Zhibo Chen

Zhibo Chen

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon

Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning

Add code
Apr 02, 2025
Viaarxiv icon

Hybrid Agents for Image Restoration

Add code
Mar 13, 2025
Viaarxiv icon

StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Add code
Mar 08, 2025
Viaarxiv icon

Q&C: When Quantization Meets Cache in Efficient Image Generation

Add code
Mar 04, 2025
Viaarxiv icon

InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model

Add code
Feb 26, 2025
Viaarxiv icon

AR4D: Autoregressive 4D Generation from Monocular Videos

Add code
Jan 03, 2025
Figure 1 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 2 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 3 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 4 for AR4D: Autoregressive 4D Generation from Monocular Videos
Viaarxiv icon

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task

Add code
Dec 24, 2024
Viaarxiv icon

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Add code
Dec 22, 2024
Viaarxiv icon

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Figure 1 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 2 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 3 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 4 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Viaarxiv icon