Picture for Yu-Gang Jiang

Yu-Gang Jiang

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Add code
Jan 02, 2025
Figure 1 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 2 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 3 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 4 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Viaarxiv icon

AIM: Additional Image Guided Generation of Transferable Adversarial Attacks

Add code
Jan 02, 2025
Viaarxiv icon

4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives

Add code
Dec 30, 2024
Viaarxiv icon

STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection

Add code
Dec 28, 2024
Viaarxiv icon

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Viaarxiv icon

Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection

Add code
Dec 23, 2024
Viaarxiv icon

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

Add code
Dec 05, 2024
Figure 1 for CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Figure 2 for CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Figure 3 for CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Figure 4 for CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Viaarxiv icon

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Add code
Dec 04, 2024
Figure 1 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 2 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 3 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 4 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Viaarxiv icon

SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images

Add code
Dec 03, 2024
Viaarxiv icon

DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model

Add code
Dec 02, 2024
Viaarxiv icon