Picture for Wenbo Hu

Wenbo Hu

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Add code
Mar 07, 2025
Viaarxiv icon

Revisiting PCA for time series reduction in temporal dimension

Add code
Dec 27, 2024
Figure 1 for Revisiting PCA for time series reduction in temporal dimension
Figure 2 for Revisiting PCA for time series reduction in temporal dimension
Figure 3 for Revisiting PCA for time series reduction in temporal dimension
Figure 4 for Revisiting PCA for time series reduction in temporal dimension
Viaarxiv icon

SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage

Add code
Dec 19, 2024
Viaarxiv icon

Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models

Add code
Dec 19, 2024
Figure 1 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 2 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 3 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 4 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Viaarxiv icon

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Add code
Dec 04, 2024
Figure 1 for NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Figure 2 for NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Figure 3 for NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Figure 4 for NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Viaarxiv icon

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Add code
Nov 27, 2024
Figure 1 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 2 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 3 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 4 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Viaarxiv icon

TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction

Add code
Nov 18, 2024
Viaarxiv icon

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Add code
Oct 10, 2024
Viaarxiv icon

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos

Add code
Sep 11, 2024
Figure 1 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 2 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 3 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 4 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Viaarxiv icon

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Add code
Sep 03, 2024
Figure 1 for ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Figure 2 for ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Figure 3 for ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Figure 4 for ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Viaarxiv icon