Picture for Bei Li

Bei Li

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Add code
Nov 10, 2025
Viaarxiv icon

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Add code
Oct 24, 2025
Figure 1 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 2 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 3 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 4 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Viaarxiv icon

IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method

Add code
Sep 26, 2025
Figure 1 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 2 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 3 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 4 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement

Add code
Aug 28, 2025
Viaarxiv icon

One Size Does Not Fit All: A Distribution-Aware Sparsification for More Precise Model Merging

Add code
Aug 08, 2025
Viaarxiv icon

Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction

Add code
Jul 30, 2025
Viaarxiv icon

GRAM: A Generative Foundation Reward Model for Reward Generalization

Add code
Jun 18, 2025
Viaarxiv icon

TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

Add code
Jun 11, 2025
Figure 1 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 2 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 3 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 4 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Viaarxiv icon