Picture for Jaehyung Kim

Jaehyung Kim

Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms

Add code
Jun 06, 2026
Viaarxiv icon

LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training

Add code
May 28, 2026
Viaarxiv icon

The Amazing Agent Race: Strong Tool Users, Weak Navigators

Add code
Apr 11, 2026
Viaarxiv icon

Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

Add code
Mar 25, 2026
Viaarxiv icon

RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models

Add code
Mar 22, 2026
Viaarxiv icon

InterPol: De-anonymizing LM Arena via Interpolated Preference Learning

Add code
Mar 16, 2026
Viaarxiv icon

INDIBATOR: Diverse and Fact-Grounded Individuality for Multi-Agent Debate in Molecular Discovery

Add code
Feb 02, 2026
Viaarxiv icon

Reasoning or Fluency? Dissecting Probabilistic Confidence in Best-of-N Selection

Add code
Jan 20, 2026
Viaarxiv icon

Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data

Add code
Jan 16, 2026
Viaarxiv icon

SPRInG: Continual LLM Personalization via Selective Parametric Adaptation and Retrieval-Interpolated Generation

Add code
Jan 15, 2026
Viaarxiv icon