Picture for Zhengyuan Yang

Zhengyuan Yang

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Add code
Jun 10, 2026
Viaarxiv icon

3D-CoS: A New 3D Reconstruction Paradigm Based on VLM Code Synthesis

Add code
Jun 09, 2026
Viaarxiv icon

Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering

Add code
Jun 01, 2026
Viaarxiv icon

Planning with the Views via Scene Self-Exploration

Add code
May 28, 2026
Viaarxiv icon

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Add code
May 19, 2026
Viaarxiv icon

TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering

Add code
Apr 27, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching

Add code
Apr 08, 2026
Viaarxiv icon

RAGEN-2: Reasoning Collapse in Agentic RL

Add code
Apr 07, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon