Picture for Mingmin Chi

Mingmin Chi

OSV: One Step is Enough for High-Quality Image to Video Generation

Add code
Sep 17, 2024
Figure 1 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 2 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 3 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 4 for OSV: One Step is Enough for High-Quality Image to Video Generation
Viaarxiv icon

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Add code
Sep 16, 2024
Viaarxiv icon

VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis

Add code
Sep 12, 2024
Figure 1 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 2 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 3 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 4 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Viaarxiv icon

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Add code
Aug 24, 2024
Figure 1 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 2 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 3 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 4 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Viaarxiv icon

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation

Add code
Aug 06, 2024
Figure 1 for MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Figure 2 for MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Figure 3 for MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Figure 4 for MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Viaarxiv icon

Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss

Add code
Jul 15, 2024
Figure 1 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 2 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 3 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 4 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Viaarxiv icon

AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval

Add code
May 28, 2024
Figure 1 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 2 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 3 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 4 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Viaarxiv icon

Single-temporal Supervised Remote Change Detection for Domain Generalization

Add code
Apr 19, 2024
Viaarxiv icon

Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection

Add code
Apr 17, 2024
Viaarxiv icon

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

Add code
Mar 18, 2024
Viaarxiv icon