Picture for Xinyang Jiang

Xinyang Jiang

One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Add code
Jul 10, 2025
Viaarxiv icon

Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization

Add code
Jul 03, 2025
Viaarxiv icon

TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Add code
May 22, 2025
Figure 1 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 2 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 3 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 4 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Viaarxiv icon

Exploring Interpretability for Visual Prompt Tuning with Hierarchical Concepts

Add code
Mar 08, 2025
Viaarxiv icon

VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution

Add code
Feb 17, 2025
Viaarxiv icon

Real-Time Neural-Enhancement for Online Cloud Gaming

Add code
Jan 12, 2025
Figure 1 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 2 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 3 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 4 for Real-Time Neural-Enhancement for Online Cloud Gaming
Viaarxiv icon

HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling

Add code
Aug 27, 2024
Figure 1 for HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Figure 2 for HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Figure 3 for HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Figure 4 for HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Viaarxiv icon

ActPrompt: In-Domain Feature Adaptation via Action Cues for Video Temporal Grounding

Add code
Aug 13, 2024
Viaarxiv icon

DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World

Add code
May 30, 2024
Figure 1 for DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World
Figure 2 for DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World
Figure 3 for DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World
Figure 4 for DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World
Viaarxiv icon

Compression-Realized Deep Structural Network for Video Quality Enhancement

Add code
May 10, 2024
Figure 1 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 2 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 3 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 4 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Viaarxiv icon