Picture for Alan Ritter

Alan Ritter

Investigating and Alleviating Harm Amplification in LLM Interactions

Add code
Jun 01, 2026
Viaarxiv icon

LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training

Add code
May 28, 2026
Viaarxiv icon

Learning to Route Languages for Multilingual Policy Optimization

Add code
May 25, 2026
Viaarxiv icon

Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression

Add code
May 20, 2026
Viaarxiv icon

Safe and Scalable Web Agent Learning via Recreated Websites

Add code
Mar 11, 2026
Viaarxiv icon

Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?

Add code
Feb 04, 2026
Viaarxiv icon

Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

GeoRC: A Benchmark for Geolocation Reasoning Chains

Add code
Jan 29, 2026
Viaarxiv icon

Auditing Language Model Unlearning via Information Decomposition

Add code
Jan 21, 2026
Viaarxiv icon

Semantic Differentiation for Tackling Challenges in Watermarking Low-Entropy Constrained Generation Outputs

Add code
Jan 14, 2026
Viaarxiv icon