Picture for Jian Xie

Jian Xie

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Add code
Nov 18, 2024
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Viaarxiv icon

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon

Boosting Deductive Reasoning with Step Signals In RLHF

Add code
Oct 12, 2024
Viaarxiv icon

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown

Add code
Oct 01, 2024
Viaarxiv icon

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Add code
Jul 16, 2024
Viaarxiv icon

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Add code
Jul 10, 2024
Figure 1 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 2 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 3 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 4 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Viaarxiv icon

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Add code
Jun 11, 2024
Figure 1 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 2 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 3 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 4 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Viaarxiv icon

SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling

Add code
May 21, 2024
Viaarxiv icon

From Persona to Personalization: A Survey on Role-Playing Language Agents

Add code
Apr 28, 2024
Viaarxiv icon