Picture for Suhas Hariharan

Suhas Hariharan

Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction

Add code
Feb 24, 2025
Viaarxiv icon

Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique

Add code
Nov 13, 2024
Figure 1 for Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique
Viaarxiv icon

GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

Add code
Jun 07, 2024
Viaarxiv icon