Picture for Jiatao Gu

Jiatao Gu

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Add code
Jan 13, 2026
Viaarxiv icon

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Add code
Dec 16, 2025
Viaarxiv icon

DenseAnnotate: Enabling Scalable Dense Caption Collection for Images and 3D Scenes via Spoken Descriptions

Add code
Nov 16, 2025
Viaarxiv icon

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Add code
Jun 26, 2025
Viaarxiv icon

Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation

Add code
Jun 06, 2025
Figure 1 for Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Figure 2 for Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Figure 3 for Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Figure 4 for Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Viaarxiv icon

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Add code
Jun 06, 2025
Figure 1 for STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Figure 2 for STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Figure 3 for STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Figure 4 for STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Viaarxiv icon

Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Add code
Feb 25, 2025
Figure 1 for Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions
Figure 2 for Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions
Figure 3 for Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions
Figure 4 for Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions
Viaarxiv icon

Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation

Add code
Jan 09, 2025
Figure 1 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 2 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 3 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 4 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Viaarxiv icon

3D Shape Tokenization

Add code
Dec 24, 2024
Figure 1 for 3D Shape Tokenization
Figure 2 for 3D Shape Tokenization
Figure 3 for 3D Shape Tokenization
Figure 4 for 3D Shape Tokenization
Viaarxiv icon

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

Add code
Dec 11, 2024
Figure 1 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 2 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 3 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 4 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Viaarxiv icon