Picture for Seanie Lee

Seanie Lee

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Add code
Jan 30, 2026
Viaarxiv icon

Rethinking Reward Models for Multi-Domain Test-Time Scaling

Add code
Oct 02, 2025
Viaarxiv icon

HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model

Add code
Jun 05, 2025
Viaarxiv icon

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Add code
May 23, 2025
Viaarxiv icon

Personalized Fine-Tuning with Controllable Synthetic Speech from LLM-Generated Transcripts for Dysarthric Speech Recognition

Add code
May 19, 2025
Viaarxiv icon

FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA

Add code
May 19, 2025
Viaarxiv icon

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Add code
Mar 24, 2025
Viaarxiv icon

FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates

Add code
Mar 11, 2025
Figure 1 for FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates
Figure 2 for FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates
Figure 3 for FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates
Figure 4 for FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates
Viaarxiv icon

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models

Add code
Oct 02, 2024
Viaarxiv icon