Picture for Tianyu Pang

Tianyu Pang

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Add code
Jun 05, 2025
Viaarxiv icon

Fostering Video Reasoning via Next-Event Prediction

Add code
May 28, 2025
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Add code
May 27, 2025
Viaarxiv icon

Lifelong Safety Alignment for Language Models

Add code
May 26, 2025
Viaarxiv icon

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Add code
May 22, 2025
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Viaarxiv icon

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Add code
May 19, 2025
Viaarxiv icon

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon