Picture for Jing Huang

Jing Huang

Agentic Reward Modeling: Verifying GUI Agent via Online Proactive Interaction

Add code
Jan 31, 2026
Viaarxiv icon

UCPO: Uncertainty-Aware Policy Optimization

Add code
Jan 30, 2026
Viaarxiv icon

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Add code
Jan 26, 2026
Viaarxiv icon

Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection

Add code
Jan 21, 2026
Viaarxiv icon

TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks

Add code
Jan 15, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

MobileDreamer: Generative Sketch World Model for GUI Agent

Add code
Jan 07, 2026
Viaarxiv icon

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Add code
Jan 04, 2026
Viaarxiv icon