Picture for Ge Zhang

Ge Zhang

EA-Agent: A Structured Multi-Step Reasoning Agent for Entity Alignment

Add code
Apr 13, 2026
Viaarxiv icon

Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations

Add code
Apr 13, 2026
Viaarxiv icon

In-Place Test-Time Training

Add code
Apr 07, 2026
Viaarxiv icon

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Add code
Mar 09, 2026
Viaarxiv icon

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Add code
Feb 26, 2026
Viaarxiv icon

WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints

Add code
Feb 09, 2026
Viaarxiv icon

The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL

Add code
Feb 06, 2026
Viaarxiv icon

BABE: Biology Arena BEnchmark

Add code
Feb 05, 2026
Viaarxiv icon

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Add code
Feb 05, 2026
Viaarxiv icon

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Add code
Jan 29, 2026
Viaarxiv icon