Picture for Renren Jin

Renren Jin

SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation

Add code
Mar 14, 2025
Viaarxiv icon

ProBench: Benchmarking Large Language Models in Competitive Programming

Add code
Feb 28, 2025
Viaarxiv icon

Large Language Model Safety: A Holistic Survey

Add code
Dec 23, 2024
Figure 1 for Large Language Model Safety: A Holistic Survey
Figure 2 for Large Language Model Safety: A Holistic Survey
Figure 3 for Large Language Model Safety: A Holistic Survey
Figure 4 for Large Language Model Safety: A Holistic Survey
Viaarxiv icon

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Add code
Nov 21, 2024
Figure 1 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 2 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 3 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 4 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Viaarxiv icon

Multilingual Large Language Models: A Systematic Survey

Add code
Nov 19, 2024
Viaarxiv icon

FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data

Add code
Aug 13, 2024
Figure 1 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 2 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 3 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 4 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Viaarxiv icon

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

Add code
Jun 26, 2024
Figure 1 for IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Figure 2 for IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Figure 3 for IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Figure 4 for IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Viaarxiv icon

ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation

Add code
May 22, 2024
Viaarxiv icon