Picture for Jiaqi Liao

Jiaqi Liao

Gym-V: A Unified Vision Environment System for Agentic Vision Research

Add code
Mar 17, 2026
Viaarxiv icon

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Add code
Oct 14, 2025
Viaarxiv icon

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Add code
May 30, 2025
Viaarxiv icon

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Add code
May 29, 2025
Viaarxiv icon

Step1X-Edit: A Practical Framework for General Image Editing

Add code
Apr 24, 2025
Viaarxiv icon

LangBridge: Interpreting Image as a Combination of Language Embeddings

Add code
Mar 26, 2025
Figure 1 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 2 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 3 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 4 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Viaarxiv icon

ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning

Add code
Mar 25, 2025
Figure 1 for ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Figure 2 for ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Figure 3 for ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Figure 4 for ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Viaarxiv icon

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation

Add code
Oct 09, 2024
Figure 1 for AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Figure 2 for AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Figure 3 for AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Figure 4 for AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Viaarxiv icon

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Add code
Oct 07, 2024
Viaarxiv icon