Picture for Xiaotao Gu

Xiaotao Gu

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Add code
Nov 14, 2025
Figure 1 for UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Figure 2 for UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Figure 3 for UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Figure 4 for UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Viaarxiv icon

WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation

Add code
Nov 09, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Add code
Mar 26, 2025
Figure 1 for VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Figure 2 for VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Figure 3 for VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Figure 4 for VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Viaarxiv icon

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Add code
Mar 18, 2025
Viaarxiv icon

StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Add code
Mar 13, 2025
Viaarxiv icon

LongSafety: Evaluating Long-Context Safety of Large Language Models

Add code
Feb 24, 2025
Figure 1 for LongSafety: Evaluating Long-Context Safety of Large Language Models
Figure 2 for LongSafety: Evaluating Long-Context Safety of Large Language Models
Figure 3 for LongSafety: Evaluating Long-Context Safety of Large Language Models
Figure 4 for LongSafety: Evaluating Long-Context Safety of Large Language Models
Viaarxiv icon

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Add code
Feb 18, 2025
Figure 1 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 2 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 3 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 4 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Viaarxiv icon

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Add code
Jan 06, 2025
Figure 1 for MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Figure 2 for MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Figure 3 for MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Figure 4 for MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Viaarxiv icon