Picture for Surbhi Goel

Surbhi Goel

Model Agreement via Anchoring

Add code
Feb 26, 2026
Viaarxiv icon

Reliable Abstention under Adversarial Injections: Tight Lower Bounds and New Upper Bounds

Add code
Feb 23, 2026
Viaarxiv icon

Emergent Alignment via Competition

Add code
Sep 18, 2025
Viaarxiv icon

Conformal Language Model Reasoning with Coherent Factuality

Add code
May 21, 2025
Viaarxiv icon

Probabilistic Stability Guarantees for Feature Attributions

Add code
Apr 18, 2025
Viaarxiv icon

Collaborative Prediction: Tractable Information Aggregation via Agreement

Add code
Apr 08, 2025
Viaarxiv icon

A Theory of Learning with Autoregressive Chain of Thought

Add code
Mar 11, 2025
Viaarxiv icon

Testing Noise Assumptions of Learning Algorithms

Add code
Jan 15, 2025
Viaarxiv icon

Tractable Agreement Protocols

Add code
Nov 29, 2024
Viaarxiv icon

Progressive distillation induces an implicit curriculum

Add code
Oct 07, 2024
Viaarxiv icon