Picture for Zhibo Chen

Zhibo Chen

AR4D: Autoregressive 4D Generation from Monocular Videos

Add code
Jan 03, 2025
Figure 1 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 2 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 3 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 4 for AR4D: Autoregressive 4D Generation from Monocular Videos
Viaarxiv icon

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task

Add code
Dec 24, 2024
Viaarxiv icon

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Add code
Dec 22, 2024
Viaarxiv icon

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Figure 1 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 2 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 3 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 4 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Viaarxiv icon

Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions

Add code
Dec 10, 2024
Figure 1 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 2 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 3 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 4 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Viaarxiv icon

UniMIC: Towards Universal Multi-modality Perceptual Image Compression

Add code
Dec 09, 2024
Figure 1 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 2 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 3 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 4 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Viaarxiv icon

LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents

Add code
Dec 05, 2024
Viaarxiv icon

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Add code
Nov 15, 2024
Viaarxiv icon

Towards Defining an Efficient and Expandable File Format for AI-Generated Contents

Add code
Oct 15, 2024
Figure 1 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 2 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 3 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 4 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Viaarxiv icon

Compositional 3D-aware Video Generation with LLM Director

Add code
Aug 31, 2024
Viaarxiv icon