Picture for Fan Zhang

Fan Zhang

University of Bristol

SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation

Add code
Feb 12, 2026
Viaarxiv icon

SpotAgent: Grounding Visual Geo-localization in Large Vision-Language Models through Agentic Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems

Add code
Feb 11, 2026
Viaarxiv icon

Constructing Industrial-Scale Optimization Modeling Benchmark

Add code
Feb 11, 2026
Viaarxiv icon

FreqLens: Interpretable Frequency Attribution for Time Series Forecasting

Add code
Feb 09, 2026
Viaarxiv icon

GeoLanG: Geometry-Aware Language-Guided Grasping with Unified RGB-D Multimodal Learning

Add code
Feb 04, 2026
Viaarxiv icon

Ebisu: Benchmarking Large Language Models in Japanese Finance

Add code
Feb 01, 2026
Viaarxiv icon

EEO-TFV: Escape-Explore Optimizer for Web-Scale Time-Series Forecasting and Vision Analysis

Add code
Jan 30, 2026
Viaarxiv icon

A Unified SPD Token Transformer Framework for EEG Classification: Systematic Comparison of Geometric Embeddings

Add code
Jan 29, 2026
Viaarxiv icon

iFAN Ecosystem: A Unified AI, Digital Twin, Cyber-Physical Security, and Robotics Environment for Advanced Nuclear Simulation and Operations

Add code
Jan 27, 2026
Viaarxiv icon