Picture for Linyi Yang

Linyi Yang

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process

Add code
Mar 11, 2025
Viaarxiv icon

LAG: LLM agents for Leaderboard Auto Generation on Demanding

Add code
Feb 25, 2025
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

Add code
Feb 19, 2025
Viaarxiv icon

Direct Preference Optimization Using Sparse Feature-Level Constraints

Add code
Nov 12, 2024
Figure 1 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 2 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 3 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 4 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Viaarxiv icon

CycleResearcher: Improving Automated Research via Automated Review

Add code
Oct 28, 2024
Viaarxiv icon

CAP: Data Contamination Detection via Consistency Amplification

Add code
Oct 19, 2024
Figure 1 for CAP: Data Contamination Detection via Consistency Amplification
Figure 2 for CAP: Data Contamination Detection via Consistency Amplification
Figure 3 for CAP: Data Contamination Detection via Consistency Amplification
Figure 4 for CAP: Data Contamination Detection via Consistency Amplification
Viaarxiv icon

Locking Down the Finetuned LLMs Safety

Add code
Oct 14, 2024
Viaarxiv icon

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Add code
Oct 12, 2024
Figure 1 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 2 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 3 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 4 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Viaarxiv icon