Picture for Bolin Ding

Bolin Ding

BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning

Add code
Oct 30, 2025
Viaarxiv icon

Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models

Add code
Aug 10, 2025
Viaarxiv icon

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Add code
Jun 11, 2025
Viaarxiv icon

Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation

Add code
Jun 06, 2025
Viaarxiv icon

Incentivizing Strong Reasoning from Weak Supervision

Add code
May 28, 2025
Viaarxiv icon

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Viaarxiv icon

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Enhancing Latent Computation in Transformers with Latent Tokens

Add code
May 19, 2025
Figure 1 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 2 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 3 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 4 for Enhancing Latent Computation in Transformers with Latent Tokens
Viaarxiv icon

Tree-based Models for Vertical Federated Learning: A Survey

Add code
Apr 03, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon