Picture for Wenjie Wang

Wenjie Wang

LiveAgentBench: Comprehensive Benchmarking of Agentic Systems Across 104 Real-World Challenges

Add code
Mar 03, 2026
Viaarxiv icon

NextAds: Towards Next-generation Personalized Video Advertising

Add code
Mar 02, 2026
Viaarxiv icon

Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

Add code
Feb 28, 2026
Viaarxiv icon

Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions

Add code
Feb 26, 2026
Viaarxiv icon

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Add code
Feb 16, 2026
Viaarxiv icon

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Add code
Feb 13, 2026
Viaarxiv icon

Synthetic Interaction Data for Scalable Personalization in Large Language Models

Add code
Feb 12, 2026
Viaarxiv icon

Binary Flow Matching: Prediction-Loss Space Alignment for Robust Learning

Add code
Feb 11, 2026
Viaarxiv icon

MeDocVL: A Visual Language Model for Medical Document Understanding and Parsing

Add code
Feb 06, 2026
Viaarxiv icon

GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?

Add code
Feb 05, 2026
Viaarxiv icon