Picture for Bowen Yu

Bowen Yu

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Add code
Nov 18, 2024
Viaarxiv icon

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs

Add code
Nov 14, 2024
Viaarxiv icon

Language Models can Self-Lengthen to Generate Long Texts

Add code
Oct 31, 2024
Figure 1 for Language Models can Self-Lengthen to Generate Long Texts
Figure 2 for Language Models can Self-Lengthen to Generate Long Texts
Figure 3 for Language Models can Self-Lengthen to Generate Long Texts
Figure 4 for Language Models can Self-Lengthen to Generate Long Texts
Viaarxiv icon

Transferable Post-training via Inverse Value Learning

Add code
Oct 28, 2024
Figure 1 for Transferable Post-training via Inverse Value Learning
Figure 2 for Transferable Post-training via Inverse Value Learning
Figure 3 for Transferable Post-training via Inverse Value Learning
Figure 4 for Transferable Post-training via Inverse Value Learning
Viaarxiv icon

Aligning Large Language Models via Self-Steering Optimization

Add code
Oct 22, 2024
Viaarxiv icon

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Add code
Oct 17, 2024
Figure 1 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 2 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 3 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Figure 4 for A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Viaarxiv icon

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Add code
Oct 12, 2024
Viaarxiv icon

Qwen2.5-Coder Technical Report

Add code
Sep 18, 2024
Viaarxiv icon

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Add code
Sep 04, 2024
Figure 1 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 2 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 3 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 4 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Viaarxiv icon