Picture for Yankai Lin

Yankai Lin

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Add code
Jun 09, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Add code
May 25, 2025
Viaarxiv icon

ToLeaP: Rethinking Development of Tool Learning with Large Language Models

Add code
May 17, 2025
Viaarxiv icon

DeepCritic: Deliberate Critique with Large Language Models

Add code
May 01, 2025
Viaarxiv icon

Learning to Generate Structured Output with Schema Reinforcement Learning

Add code
Feb 26, 2025
Viaarxiv icon

AgentRM: Enhancing Agent Generalization with Reward Modeling

Add code
Feb 25, 2025
Viaarxiv icon

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Add code
Feb 25, 2025
Viaarxiv icon

TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System

Add code
Dec 14, 2024
Figure 1 for TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System
Figure 2 for TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System
Figure 3 for TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System
Figure 4 for TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System
Viaarxiv icon

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

Add code
Nov 08, 2024
Viaarxiv icon