Picture for An Zhang

An Zhang

The Emergence of Abstract Thought in Large Language Models Beyond Any Language

Add code
Jun 11, 2025
Viaarxiv icon

On Reasoning Strength Planning in Large Reasoning Models

Add code
Jun 10, 2025
Viaarxiv icon

RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards

Add code
Jun 09, 2025
Viaarxiv icon

AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint

Add code
Jun 08, 2025
Viaarxiv icon

AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems

Add code
May 26, 2025
Viaarxiv icon

MPO: Multilingual Safety Alignment via Reward Gap Optimization

Add code
May 22, 2025
Viaarxiv icon

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs

Add code
May 16, 2025
Viaarxiv icon

AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings

Add code
Apr 29, 2025
Viaarxiv icon

AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

Add code
Apr 13, 2025
Viaarxiv icon

SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models

Add code
Apr 09, 2025
Viaarxiv icon