Picture for Fan Zhang

Fan Zhang

University of Bristol

Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation

Add code
Apr 03, 2025
Viaarxiv icon

VISTA: Unsupervised 2D Temporal Dependency Representations for Time Series Anomaly Detection

Add code
Apr 03, 2025
Viaarxiv icon

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Add code
Mar 27, 2025
Viaarxiv icon

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Add code
Mar 27, 2025
Viaarxiv icon

GIViC: Generative Implicit Video Compression

Add code
Mar 25, 2025
Viaarxiv icon

GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations

Add code
Mar 20, 2025
Viaarxiv icon

Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization

Add code
Mar 19, 2025
Viaarxiv icon

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling

Add code
Mar 19, 2025
Viaarxiv icon

C2D-ISR: Optimizing Attention-based Image Super-resolution from Continuous to Discrete Scales

Add code
Mar 17, 2025
Viaarxiv icon

Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences

Add code
Mar 17, 2025
Viaarxiv icon