Picture for Zhucun Xue

Zhucun Xue

Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding

Add code
Jun 16, 2025
Viaarxiv icon

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

Image Inversion: A Survey from GANs to Diffusion and Beyond

Add code
Feb 17, 2025
Viaarxiv icon

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction

Add code
Jan 01, 2025
Figure 1 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 2 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 3 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 4 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Viaarxiv icon

EMOv2: Pushing 5M Vision Model Frontier

Add code
Dec 09, 2024
Figure 1 for EMOv2: Pushing 5M Vision Model Frontier
Figure 2 for EMOv2: Pushing 5M Vision Model Frontier
Figure 3 for EMOv2: Pushing 5M Vision Model Frontier
Figure 4 for EMOv2: Pushing 5M Vision Model Frontier
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

Add code
Apr 16, 2024
Viaarxiv icon

Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection

Add code
Nov 05, 2023
Viaarxiv icon

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Add code
Jan 10, 2023
Viaarxiv icon

Rethinking Mobile Block for Efficient Neural Models

Add code
Jan 10, 2023
Viaarxiv icon