Picture for Bohan Li

Bohan Li

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

Add code
Apr 13, 2026
Viaarxiv icon

PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation

Add code
Mar 23, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction

Add code
Nov 08, 2025
Figure 1 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 2 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 3 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 4 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Viaarxiv icon

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

Add code
Oct 27, 2025
Figure 1 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 2 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 3 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 4 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Viaarxiv icon

One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation

Add code
Sep 09, 2025
Figure 1 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 2 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 3 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 4 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Viaarxiv icon

Next Tokens Denoising for Speech Synthesis

Add code
Jul 30, 2025
Viaarxiv icon

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Add code
Jun 24, 2025
Figure 1 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 2 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 3 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 4 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Viaarxiv icon

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Add code
Jun 06, 2025
Viaarxiv icon