Picture for J. Zico Kolter

J. Zico Kolter

Carnegie Mellon University

Antidistillation Fingerprinting

Add code
Feb 03, 2026
Viaarxiv icon

When Should We Introduce Safety Interventions During Pretraining?

Add code
Jan 11, 2026
Viaarxiv icon

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

Add code
Jan 06, 2026
Viaarxiv icon

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Add code
Dec 10, 2025
Figure 1 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 2 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 3 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 4 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Viaarxiv icon

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search

Add code
Nov 10, 2025
Viaarxiv icon

Evaluating Language Model Reasoning about Confidential Information

Add code
Aug 27, 2025
Viaarxiv icon

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Add code
Jul 28, 2025
Viaarxiv icon

OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics

Add code
Jun 14, 2025
Figure 1 for OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Figure 2 for OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Figure 3 for OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Figure 4 for OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Viaarxiv icon

Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation

Add code
Jun 09, 2025
Viaarxiv icon

Mean Flows for One-step Generative Modeling

Add code
May 19, 2025
Viaarxiv icon