Picture for Feng Wang

Feng Wang

ViT-5: Vision Transformers for The Mid-2020s

Add code
Feb 08, 2026
Viaarxiv icon

WorldEdit: Towards Open-World Image Editing with a Knowledge-Informed Benchmark

Add code
Feb 06, 2026
Viaarxiv icon

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Add code
Feb 05, 2026
Viaarxiv icon

Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

Add code
Feb 05, 2026
Viaarxiv icon

VTok: A Unified Video Tokenizer with Decoupled Spatial-Temporal Latents

Add code
Feb 04, 2026
Viaarxiv icon

Spiral RoPE: Rotate Your Rotary Positional Embeddings in the 2D Plane

Add code
Feb 03, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

A Generalist Foundation Model for Total-body PET/CT Enables Diagnostic Reporting and System-wide Metabolic Profiling

Add code
Jan 19, 2026
Viaarxiv icon

SRAW-Attack: Space-Reweighted Adversarial Warping Attack for SAR Target Recognition

Add code
Jan 15, 2026
Viaarxiv icon

GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning

Add code
Jan 08, 2026
Viaarxiv icon