Picture for Bohan Li

Bohan Li

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction

Add code
Nov 08, 2025
Figure 1 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 2 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 3 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 4 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Viaarxiv icon

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

Add code
Oct 27, 2025
Figure 1 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 2 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 3 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 4 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Viaarxiv icon

One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation

Add code
Sep 09, 2025
Figure 1 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 2 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 3 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Figure 4 for One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
Viaarxiv icon

Next Tokens Denoising for Speech Synthesis

Add code
Jul 30, 2025
Viaarxiv icon

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Add code
Jun 24, 2025
Figure 1 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 2 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 3 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Figure 4 for Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Viaarxiv icon

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Add code
Jun 06, 2025
Viaarxiv icon

Towards General Discrete Speech Codec for Complex Acoustic Environments: A Study of Reconstruction and Downstream Task Consistency

Add code
May 28, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Figure 1 for Challenger: Affordable Adversarial Driving Video Generation
Figure 2 for Challenger: Affordable Adversarial Driving Video Generation
Figure 3 for Challenger: Affordable Adversarial Driving Video Generation
Figure 4 for Challenger: Affordable Adversarial Driving Video Generation
Viaarxiv icon

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism

Add code
May 20, 2025
Viaarxiv icon