Picture for Lijie Hu

Lijie Hu

MedFM-Robust: Benchmarking Robustness of Medical Foundation Models

Add code
May 21, 2026
Viaarxiv icon

Matryoshka Concept Bottleneck Models

Add code
May 20, 2026
Viaarxiv icon

From Instance Selection to Fixed-Pool Data Recipe Search for Supervised Fine-Tuning

Add code
May 13, 2026
Viaarxiv icon

UGID: Unified Graph Isomorphism for Debiasing Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Functional Subspace Watermarking for Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

Add code
Mar 18, 2026
Viaarxiv icon

SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Add code
Mar 16, 2026
Viaarxiv icon

Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency

Add code
Mar 12, 2026
Viaarxiv icon

Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness

Add code
Mar 11, 2026
Viaarxiv icon

Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

Add code
Mar 11, 2026
Viaarxiv icon