Picture for Zheng-Jun Zha

Zheng-Jun Zha

University of Science and Technology of China

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

Add code
Nov 29, 2024
Viaarxiv icon

Leverage Task Context for Object Affordance Ranking

Add code
Nov 25, 2024
Viaarxiv icon

Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution

Add code
Nov 21, 2024
Viaarxiv icon

$\text{S}^{3}$Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model

Add code
Nov 16, 2024
Viaarxiv icon

Improved Video VAE for Latent Video Diffusion Model

Add code
Nov 10, 2024
Viaarxiv icon

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Add code
Oct 20, 2024
Viaarxiv icon

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

Add code
Oct 15, 2024
Viaarxiv icon

Visual-Geometric Collaborative Guidance for Affordance Learning

Add code
Oct 15, 2024
Viaarxiv icon

ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization

Add code
Oct 14, 2024
Viaarxiv icon