Picture for In So Kweon

In So Kweon

Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality

Add code
Oct 07, 2024
Figure 1 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 2 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 3 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 4 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Viaarxiv icon

360 in the Wild: Dataset for Depth Prediction and View Synthesis

Add code
Jun 27, 2024
Figure 1 for 360 in the Wild: Dataset for Depth Prediction and View Synthesis
Figure 2 for 360 in the Wild: Dataset for Depth Prediction and View Synthesis
Figure 3 for 360 in the Wild: Dataset for Depth Prediction and View Synthesis
Figure 4 for 360 in the Wild: Dataset for Depth Prediction and View Synthesis
Viaarxiv icon

Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition

Add code
Jun 13, 2024
Figure 1 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 2 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 3 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 4 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Viaarxiv icon

Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting

Add code
Jun 04, 2024
Figure 1 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 2 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 3 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 4 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Viaarxiv icon

MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark

Add code
Mar 29, 2024
Viaarxiv icon

Stable Surface Regularization for Fast Few-Shot NeRF

Add code
Mar 29, 2024
Viaarxiv icon

Towards Understanding Dual BN In Hybrid Adversarial Training

Add code
Mar 28, 2024
Figure 1 for Towards Understanding Dual BN In Hybrid Adversarial Training
Figure 2 for Towards Understanding Dual BN In Hybrid Adversarial Training
Figure 3 for Towards Understanding Dual BN In Hybrid Adversarial Training
Figure 4 for Towards Understanding Dual BN In Hybrid Adversarial Training
Viaarxiv icon

ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object

Add code
Mar 27, 2024
Figure 1 for ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Figure 2 for ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Figure 3 for ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Figure 4 for ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Viaarxiv icon

DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

Add code
Nov 30, 2023
Figure 1 for DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution
Figure 2 for DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution
Figure 3 for DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution
Figure 4 for DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution
Viaarxiv icon

Blurry Video Compression: A Trade-off between Visual Enhancement and Data Compression

Add code
Nov 08, 2023
Viaarxiv icon