Picture for Haotian Xu

Haotian Xu

Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model

Add code
Mar 28, 2025
Viaarxiv icon

JPDS-NN: Reinforcement Learning-Based Dynamic Task Allocation for Agricultural Vehicle Routing Optimization

Add code
Mar 04, 2025
Viaarxiv icon

Ordered Genetic Algorithm for Entrance Dependent Vehicle Routing Problem in Farms

Add code
Feb 26, 2025
Viaarxiv icon

From System 1 to System 2: A Survey of Reasoning Large Language Models

Add code
Feb 25, 2025
Viaarxiv icon

Training Large Language Models to be Better Rule Followers

Add code
Feb 17, 2025
Viaarxiv icon

VaiBot: Shuttle Between the Instructions and Parameters

Add code
Feb 04, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

Interpretable Contrastive Monte Carlo Tree Search Reasoning

Add code
Oct 02, 2024
Figure 1 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 2 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 3 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 4 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Viaarxiv icon

Inference for Large Scale Regression Models with Dependent Errors

Add code
Sep 08, 2024
Figure 1 for Inference for Large Scale Regression Models with Dependent Errors
Figure 2 for Inference for Large Scale Regression Models with Dependent Errors
Figure 3 for Inference for Large Scale Regression Models with Dependent Errors
Figure 4 for Inference for Large Scale Regression Models with Dependent Errors
Viaarxiv icon

Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing

Add code
Jul 12, 2024
Figure 1 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 2 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 3 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 4 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Viaarxiv icon