Picture for Bei Li

Bei Li

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Causal Autoregressive Diffusion Language Model

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search

Add code
Jan 16, 2026
Viaarxiv icon

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Add code
Nov 10, 2025
Figure 1 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 2 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 3 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 4 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Viaarxiv icon

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Add code
Oct 24, 2025
Figure 1 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 2 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 3 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 4 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Viaarxiv icon

IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method

Add code
Sep 26, 2025
Figure 1 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 2 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 3 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 4 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement

Add code
Aug 28, 2025
Viaarxiv icon