Picture for Huaijun Li

Huaijun Li

Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning

Add code
Mar 07, 2025
Viaarxiv icon

SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents

Add code
Mar 05, 2024
Viaarxiv icon

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models

Add code
Feb 28, 2024
Viaarxiv icon