Picture for Xuanjing Huang

Xuanjing Huang

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction

Add code
Feb 08, 2025
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Toward Relative Positional Encoding in Spiking Transformers

Add code
Jan 28, 2025
Viaarxiv icon

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Add code
Jan 26, 2025
Viaarxiv icon

Dendritic Localized Learning: Toward Biologically Plausible Algorithm

Add code
Jan 17, 2025
Viaarxiv icon

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Add code
Jan 07, 2025
Viaarxiv icon

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Add code
Dec 20, 2024
Figure 1 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 2 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 3 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 4 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Viaarxiv icon

EvoWiki: Evaluating LLMs on Evolving Knowledge

Add code
Dec 18, 2024
Viaarxiv icon

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Add code
Dec 18, 2024
Viaarxiv icon