Picture for Jian Xie

Jian Xie

Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators

Add code
Jan 16, 2025
Viaarxiv icon

Baichuan4-Finance Technical Report

Add code
Dec 17, 2024
Viaarxiv icon

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Add code
Nov 18, 2024
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Viaarxiv icon

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon

Boosting Deductive Reasoning with Step Signals In RLHF

Add code
Oct 12, 2024
Viaarxiv icon

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown

Add code
Oct 01, 2024
Viaarxiv icon

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Add code
Jul 16, 2024
Viaarxiv icon

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Add code
Jul 10, 2024
Figure 1 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 2 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 3 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 4 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Viaarxiv icon

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Add code
Jun 11, 2024
Figure 1 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 2 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 3 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 4 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Viaarxiv icon