Picture for Jiaqi Wang

Jiaqi Wang

Michael Pokorny

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Add code
Oct 02, 2025
Viaarxiv icon

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Add code
Sep 26, 2025
Figure 1 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 2 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 3 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 4 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning

Add code
Sep 16, 2025
Viaarxiv icon

ConvergeWriter: Data-Driven Bottom-Up Article Construction

Add code
Sep 16, 2025
Viaarxiv icon

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Add code
Aug 28, 2025
Viaarxiv icon

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Add code
Aug 27, 2025
Viaarxiv icon

DiCache: Let Diffusion Model Determine Its Own Cache

Add code
Aug 24, 2025
Figure 1 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 2 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 3 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 4 for DiCache: Let Diffusion Model Determine Its Own Cache
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon