Picture for Jia Li

Jia Li

IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck

Add code
Feb 26, 2026
Viaarxiv icon

DHP: Efficient Scaling of MLLM Training with Dynamic Hybrid Parallelism

Add code
Feb 25, 2026
Viaarxiv icon

Train Short, Inference Long: Training-free Horizon Extension for Autoregressive Video Generation

Add code
Feb 17, 2026
Viaarxiv icon

WorldTree: Towards 4D Dynamic Worlds from Monocular Video using Tree-Chains

Add code
Feb 12, 2026
Viaarxiv icon

Time-to-Event Transformer to Capture Timing Attention of Events in EHR Time Series

Add code
Feb 11, 2026
Viaarxiv icon

Improving Variable-Length Generation in Diffusion Language Models via Length Regularization

Add code
Feb 07, 2026
Viaarxiv icon

Principled Synthetic Data Enables the First Scaling Laws for LLMs in Recommendation

Add code
Feb 07, 2026
Viaarxiv icon

Exposing Weaknesses of Large Reasoning Models through Graph Algorithm Problems

Add code
Feb 06, 2026
Viaarxiv icon

ARGaze: Autoregressive Transformers for Online Egocentric Gaze Estimation

Add code
Feb 04, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon