Picture for Wenbo Hu

Wenbo Hu

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Add code
Oct 10, 2024
Viaarxiv icon

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos

Add code
Sep 11, 2024
Figure 1 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 2 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 3 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 4 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Viaarxiv icon

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Add code
Sep 03, 2024
Figure 1 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 2 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 3 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 4 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Viaarxiv icon

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Add code
Sep 03, 2024
Viaarxiv icon

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Add code
May 30, 2024
Viaarxiv icon

Matryoshka Query Transformer for Large Vision-Language Models

Add code
May 29, 2024
Viaarxiv icon

Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

Add code
May 28, 2024
Viaarxiv icon

Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids

Add code
May 03, 2024
Viaarxiv icon

VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models

Add code
Apr 22, 2024
Viaarxiv icon

Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

Add code
Mar 24, 2024
Viaarxiv icon