Picture for Dianbo Liu

Dianbo Liu

Oggi

Quotient DAGs for Off-Policy Evaluation:Forward-Flow Importance Sampling and Exact Slate Propensities

Add code
May 28, 2026
Viaarxiv icon

JEDI: Joint Embedding Diffusion World Model for Online Model-Based Reinforcement Learning

Add code
May 13, 2026
Viaarxiv icon

Absurd World: A Simple Yet Powerful Method to Absurdify the Real-world for Probing LLM Reasoning Capabilities

Add code
May 10, 2026
Viaarxiv icon

Resolving the bias-precision paradox with stochastic causal representation learning for personalized medicine

Add code
May 07, 2026
Viaarxiv icon

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Add code
Mar 17, 2026
Viaarxiv icon

VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization

Add code
Mar 17, 2026
Viaarxiv icon

Expected Return Causes Outcome-Level Mode Collapse in Reinforcement Learning and How to Fix It with Inverse Probability Scaling

Add code
Jan 29, 2026
Viaarxiv icon

AI-generated data contamination erodes pathological variability and diagnostic reliability

Add code
Jan 21, 2026
Viaarxiv icon

Bridging Mechanistic Interpretability and Prompt Engineering with Gradient Ascent for Interpretable Persona Control

Add code
Jan 06, 2026
Viaarxiv icon

How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcoders

Add code
Nov 18, 2025
Figure 1 for How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcoders
Figure 2 for How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcoders
Figure 3 for How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcoders
Figure 4 for How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcoders
Viaarxiv icon