Picture for Zhuorui Ye

Zhuorui Ye

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Figure 1 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 2 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 3 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 4 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters

Add code
May 20, 2025
Viaarxiv icon

Sing it, Narrate it: Quality Musical Lyrics Translation

Add code
Oct 29, 2024
Figure 1 for Sing it, Narrate it: Quality Musical Lyrics Translation
Figure 2 for Sing it, Narrate it: Quality Musical Lyrics Translation
Figure 3 for Sing it, Narrate it: Quality Musical Lyrics Translation
Figure 4 for Sing it, Narrate it: Quality Musical Lyrics Translation
Viaarxiv icon

Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models

Add code
Oct 25, 2024
Figure 1 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 2 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 3 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 4 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Viaarxiv icon

Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels

Add code
Jul 22, 2024
Figure 1 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 2 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 3 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 4 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Viaarxiv icon

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Add code
Dec 15, 2023
Figure 1 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 2 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 3 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 4 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Viaarxiv icon

Autonomous Tree-search Ability of Large Language Models

Add code
Oct 14, 2023
Figure 1 for Autonomous Tree-search Ability of Large Language Models
Figure 2 for Autonomous Tree-search Ability of Large Language Models
Figure 3 for Autonomous Tree-search Ability of Large Language Models
Figure 4 for Autonomous Tree-search Ability of Large Language Models
Viaarxiv icon