Picture for Viet Anh Nguyen

Viet Anh Nguyen

Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards

Add code
Feb 03, 2026
Viaarxiv icon

Provably Data-driven Multiple Hyper-parameter Tuning with Structured Loss Function

Add code
Feb 02, 2026
Viaarxiv icon

Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering

Add code
Jan 29, 2026
Viaarxiv icon

SCOPE: Spectral Concentration by Distributionally Robust Joint Covariance-Precision Estimation

Add code
Nov 18, 2025
Viaarxiv icon

Test-time Diverse Reasoning by Riemannian Activation Steering

Add code
Nov 11, 2025
Figure 1 for Test-time Diverse Reasoning by Riemannian Activation Steering
Figure 2 for Test-time Diverse Reasoning by Riemannian Activation Steering
Figure 3 for Test-time Diverse Reasoning by Riemannian Activation Steering
Figure 4 for Test-time Diverse Reasoning by Riemannian Activation Steering
Viaarxiv icon

Structured Pruning for Diverse Best-of-N Reasoning Optimization

Add code
Jun 09, 2025
Viaarxiv icon

Mixture-of-Personas Language Models for Population Simulation

Add code
Apr 07, 2025
Figure 1 for Mixture-of-Personas Language Models for Population Simulation
Figure 2 for Mixture-of-Personas Language Models for Population Simulation
Figure 3 for Mixture-of-Personas Language Models for Population Simulation
Figure 4 for Mixture-of-Personas Language Models for Population Simulation
Viaarxiv icon

Task-driven Layerwise Additive Activation Intervention

Add code
Feb 10, 2025
Figure 1 for Task-driven Layerwise Additive Activation Intervention
Figure 2 for Task-driven Layerwise Additive Activation Intervention
Figure 3 for Task-driven Layerwise Additive Activation Intervention
Figure 4 for Task-driven Layerwise Additive Activation Intervention
Viaarxiv icon

Probe-Free Low-Rank Activation Intervention

Add code
Feb 06, 2025
Figure 1 for Probe-Free Low-Rank Activation Intervention
Figure 2 for Probe-Free Low-Rank Activation Intervention
Figure 3 for Probe-Free Low-Rank Activation Intervention
Figure 4 for Probe-Free Low-Rank Activation Intervention
Viaarxiv icon

Risk-Aware Distributional Intervention Policies for Language Models

Add code
Jan 27, 2025
Viaarxiv icon