Picture for Zhongqi Yue

Zhongqi Yue

Thinking with Images as Continuous Actions: Numerical Visual Chain-of-Thought

Add code
Feb 27, 2026
Viaarxiv icon

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 18, 2025
Viaarxiv icon

Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 12, 2025
Viaarxiv icon

Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Add code
Apr 20, 2025
Figure 1 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 2 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 3 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 4 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Viaarxiv icon

Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

Add code
Apr 09, 2025
Figure 1 for Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
Figure 2 for Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
Figure 3 for Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
Figure 4 for Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
Viaarxiv icon

Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness

Add code
Dec 09, 2024
Figure 1 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 2 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 3 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 4 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Few-shot Learner Parameterization by Diffusion Time-steps

Add code
Mar 05, 2024
Figure 1 for Few-shot Learner Parameterization by Diffusion Time-steps
Figure 2 for Few-shot Learner Parameterization by Diffusion Time-steps
Figure 3 for Few-shot Learner Parameterization by Diffusion Time-steps
Figure 4 for Few-shot Learner Parameterization by Diffusion Time-steps
Viaarxiv icon

Exploring Diffusion Time-steps for Unsupervised Representation Learning

Add code
Jan 21, 2024
Viaarxiv icon