Picture for Qika Lin

Qika Lin

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Add code
Feb 05, 2026
Viaarxiv icon

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Add code
Feb 03, 2026
Viaarxiv icon

FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

From Latent Signals to Reflection Behavior: Tracing Meta-Cognitive Activation Trajectory in R1-Style LLMs

Add code
Feb 02, 2026
Viaarxiv icon

S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Towards Efficient and Robust Linguistic Emotion Diagnosis for Mental Health via Multi-Agent Instruction Refinement

Add code
Jan 20, 2026
Viaarxiv icon

$A^3$-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Add code
Jan 14, 2026
Viaarxiv icon

MAXS: Meta-Adaptive Exploration with LLM Agents

Add code
Jan 14, 2026
Viaarxiv icon

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning

Add code
Jul 29, 2025
Viaarxiv icon