Picture for Jiayi Guo

Jiayi Guo

When Do LLM Agents Treat Surface Noise Differently from Semantic Noise? A 68-Cell Measurement Study with a Held-Out Trace-Level Validation

Add code
May 26, 2026
Viaarxiv icon

DyCoRM: Dynamic Criterion-Aware Reward Modeling for Text-to-Image Generation

Add code
May 25, 2026
Viaarxiv icon

InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation

Add code
May 14, 2026
Viaarxiv icon

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

Add code
Apr 28, 2026
Viaarxiv icon

Purging the Gray Zone: Latent-Geometric Denoising for Precise Knowledge Boundary Awareness

Add code
Apr 15, 2026
Viaarxiv icon

Starting Off on the Wrong Foot: Pitfalls in Data Preparation

Add code
Mar 18, 2026
Viaarxiv icon

PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation

Add code
Mar 03, 2026
Viaarxiv icon

Elastic Diffusion Transformer

Add code
Feb 15, 2026
Viaarxiv icon

FastVMT: Eliminating Redundancy in Video Motion Transfer

Add code
Feb 05, 2026
Viaarxiv icon

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

Add code
Nov 11, 2024
Figure 1 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 2 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 3 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 4 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Viaarxiv icon