Picture for Jun Xu

Jun Xu

Nankai University

When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs

Add code
Jan 16, 2026
Viaarxiv icon

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

Add code
Jan 15, 2026
Viaarxiv icon

GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR

Add code
Jan 14, 2026
Viaarxiv icon

Long-term Task-oriented Agent: Proactive Long-term Intent Maintenance in Dynamic Environments

Add code
Jan 14, 2026
Viaarxiv icon

Efficient Paths and Dense Rewards: Probabilistic Flow Reasoning for Large Language Models

Add code
Jan 14, 2026
Viaarxiv icon

UserLM-R1: Modeling Human Reasoning in User Language Models with Multi-Reward Reinforcement Learning

Add code
Jan 14, 2026
Viaarxiv icon

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Add code
Jan 13, 2026
Viaarxiv icon

Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Add code
Jan 13, 2026
Viaarxiv icon

Rotation-Robust Regression with Convolutional Model Trees

Add code
Jan 08, 2026
Viaarxiv icon

Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction

Add code
Jan 07, 2026
Viaarxiv icon