Picture for Wei Lu

Wei Lu

LMI

PEAR: Phase Entropy Aware Reward for Efficient Reasoning

Add code
Oct 09, 2025
Viaarxiv icon

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Add code
Sep 26, 2025
Viaarxiv icon

P/D-Device: Disaggregated Large Language Model between Cloud and Devices

Add code
Aug 12, 2025
Viaarxiv icon

Aligning Large Language Model Agents with Rational and Moral Preferences: A Supervised Fine-Tuning Approach

Add code
Jul 28, 2025
Viaarxiv icon

A Multimodal Deviation Perceiving Framework for Weakly-Supervised Temporal Forgery Localization

Add code
Jul 22, 2025
Viaarxiv icon

Towards Reliable Forgetting: A Survey on Machine Unlearning Verification, Challenges, and Future Directions

Add code
Jun 18, 2025
Viaarxiv icon

Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization

Add code
Jun 10, 2025
Viaarxiv icon

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Add code
May 29, 2025
Viaarxiv icon

From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

Add code
May 22, 2025
Viaarxiv icon