Picture for Yukai Shi

Yukai Shi

Simulating the Real World: A Unified Survey of Multimodal Generative Models

Add code
Mar 06, 2025
Figure 1 for Simulating the Real World: A Unified Survey of Multimodal Generative Models
Figure 2 for Simulating the Real World: A Unified Survey of Multimodal Generative Models
Figure 3 for Simulating the Real World: A Unified Survey of Multimodal Generative Models
Figure 4 for Simulating the Real World: A Unified Survey of Multimodal Generative Models
Viaarxiv icon

CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond

Add code
Feb 20, 2025
Viaarxiv icon

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation

Add code
Dec 15, 2024
Figure 1 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 2 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 3 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 4 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Viaarxiv icon

UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction

Add code
Oct 17, 2024
Figure 1 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 2 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 3 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 4 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Viaarxiv icon

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

Add code
Oct 10, 2024
Viaarxiv icon

CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation

Add code
Jul 20, 2024
Figure 1 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 2 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 3 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 4 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Viaarxiv icon

Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

Add code
Jun 02, 2024
Viaarxiv icon

IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

Add code
Mar 18, 2024
Figure 1 for IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
Figure 2 for IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
Figure 3 for IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
Figure 4 for IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
Viaarxiv icon

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

Add code
Mar 18, 2024
Viaarxiv icon

SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection

Add code
Mar 08, 2024
Figure 1 for SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection
Figure 2 for SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection
Figure 3 for SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection
Figure 4 for SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection
Viaarxiv icon