Picture for Jing Zhang

Jing Zhang

The University of Sydney, Australia

Residual Diffusion Bridge Model for Image Restoration

Add code
Oct 27, 2025
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

Next-Generation AI-Native Wireless Communications: MCMC-Based Receiver Architectures for Unified Processing

Add code
Oct 02, 2025
Viaarxiv icon

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Add code
Oct 02, 2025
Viaarxiv icon

Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech

Add code
Sep 19, 2025
Figure 1 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 2 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 3 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 4 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Viaarxiv icon

FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction

Add code
Sep 18, 2025
Viaarxiv icon

TFANet: Three-Stage Image-Text Feature Alignment Network for Robust Referring Image Segmentation

Add code
Sep 16, 2025
Viaarxiv icon

Patch Progression Masked Autoencoder with Fusion CNN Network for Classifying Evolution Between Two Pairs of 2D OCT Slices

Add code
Aug 27, 2025
Viaarxiv icon

Handling Students Dropouts in an LLM-driven Interactive Online Course Using Language Models

Add code
Aug 24, 2025
Viaarxiv icon

Structural Energy-Guided Sampling for View-Consistent Text-to-3D

Add code
Aug 23, 2025
Viaarxiv icon