Picture for Keming Lu

Keming Lu

additional authors not shown

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

Add code
Feb 24, 2025
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Add code
Dec 10, 2024
Figure 1 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 2 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 3 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 4 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Viaarxiv icon

Aligning Large Language Models via Self-Steering Optimization

Add code
Oct 22, 2024
Viaarxiv icon

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Add code
Oct 17, 2024
Figure 1 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 2 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 3 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 4 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Viaarxiv icon

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Add code
Sep 04, 2024
Figure 1 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 2 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 3 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 4 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Viaarxiv icon

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

Add code
Aug 20, 2024
Figure 1 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 2 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 3 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Figure 4 for Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Add code
Jun 30, 2024
Figure 1 for LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Figure 2 for LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Figure 3 for LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Figure 4 for LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Viaarxiv icon