Picture for Shiming Xiang

Shiming Xiang

VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents

Add code
Mar 17, 2026
Viaarxiv icon

MeTok: An Efficient Meteorological Tokenization with Hyper-Aligned Group Learning for Precipitation Nowcasting

Add code
Mar 14, 2026
Viaarxiv icon

PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting

Add code
Mar 14, 2026
Viaarxiv icon

Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding

Add code
Mar 11, 2026
Viaarxiv icon

SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation

Add code
Mar 02, 2026
Viaarxiv icon

HVR-Met: A Hypothesis-Verification-Replaning Agentic System for Extreme Weather Diagnosis

Add code
Mar 01, 2026
Viaarxiv icon

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering

Add code
Feb 27, 2026
Viaarxiv icon

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

Add code
Feb 22, 2026
Viaarxiv icon

Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions

Add code
Feb 10, 2026
Viaarxiv icon

Enhanced Graph Transformer with Serialized Graph Tokens

Add code
Feb 09, 2026
Viaarxiv icon